BEST AVAILABLE COPY 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 




PCX 

INTERNATIONAL APPUCATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classiflcation 7 ; 

C12Q yes, A61K 3y00, A61P 43/00 



A2 



(11) International Publication Number: 



WO 00/52210 



(43) International Publication Date: 8 September 2000 (08.09.00) 



(21) International Application Number: 



pcmjsoa^0527i 



(22) International Filing Date: 



29 February 2000 (29.02.00) 



(30) Priority Data: 
60/122,199 



1 March 1999 (01.03.99) 



US 



(71) Applicant: VARIAGENICS, INC. [US/US]; 60 Hampshire 

Street, Cambridge, MA 02139 (US), 

(72) Inventors: STANTON, Vincent, P., Jr.; 32 Royal Road, 

Belmont. MA 02178 (US). BASILION, James. P.; 27 
Stetson Street #1, Brookline, MA 02446 (US). SHEN, Ling; 
Apartment 8, 2 Royal Crest Drive, Marlboro, MA 01752 
(US). VERDINE, Gregory; 7 Bennington Road, Uxington, 
MA 02421 (US). 

(74) Agents: WARBURG, Richai^, J. et al.; Suite 1200, 550 West 
C Street, San Diego, CA 92101 (US). 



(54) Titfe: METHODS FOR TARGETING RNA MOLECULES 



a 



AARS-1013U/CmRNA 
U-allele &*allele 



-SI 4 SI 




(81) Designated States: AU, CA, JP. European patent (AT, BE, 
CH, CY, DE. DK, ES. FI, FR, GB, GR, IE. IT, LU, MC, 
NL, PT, SE). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



RPA70-1674U/C mRNA 
U-allele C-allele 



SI ♦SI .T| 



-SI «fSI ♦Tt 




(57) Abstract 

• ^^"^^^ are methods for identifying targets in RNA molecules based on differences in RNA secondary structure related to seauence 
t^^i.^l!TJf^' ^"^'^ omis. Also described are methods for identifying or developing small molecules which in^^ 
to modulate a taiget RNA by creation of ptotein-small-moleculc-RNA complexes, as well as compositions which include such s^ 
molecules and methods of using those small molecules and compositions. i~ » niui inciuae sucn small 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCT. 



AL 


AflMmia 


ES 


Spain 


LS 


AM 


Armenia 


FI 


Finland 


LT 


AT 


Austria 


FR 


France 


LU 


AU 


Australia 


GA 


Gabon 


LV 


AZ 


Azeitaijan 


GB 


United Kingdon 


MC 


BA 


Bosnia and Herzegovina 


GE 


Geoigia 


MD 


BB 


Baibados 


GH 


Ghana 


MG 


BE 


Belgium 


GN 


Guinea 


MK 


BP 


Burkina Faso 


GR 


Greece 


BG 


Bulgaria 


HU 


Hungary 


ML 


BJ 


Benin 


IE 


Ireland 


MN 


BR 


Brazil 


IL 


Israel 


MR 


BY 


Belarus 


IS 


Iceland 


MW 


CA 


Canada 


IT 


Italy 


MX 


CF 


Central African Republic 


JP 


Japan 


NE 


CG 


Congo 


KE 


Kenya 


NL 


CH 


Switzerland 


KG 


Kyigyzstan 


NO 


a 


CAte d'lvoire 


KP 


Democratic People's 


NZ 


CM 


Cameroon 




Rqmblic of Kotea 


PL 


CN 


China 


KR 


Republic of Korea 


PT 


cu 


Cuba 


KZ 


Kazakstan 


RO 


cz 


Czech Republic 


LC 


Saint Lucia 


RU 


DE 


Germany 


U 


Liechtensteio 


SD 


DK 


Denmark 


LK 


Sri Lanka 


SE 


BB 


Estonia 


LR 


Liberia 


SG 



Lesotho 


SI 


Slovenia 


Lithuania 


SK 


Slovakia 


Luxembourg 


SN 


Senegal 


Latvia 


sz 


Swaziland 


MOTUCO 


TD 


Chad 


Rqmblic of Moldova 


TG 


Togo 


Madagascar 


TJ 


Tajikistan 


The former Yugoslav 


TM 


Turkmenistan 


Republic of Macedonia 


TR 


Ttiricey 


Mali 


TT 


Trinidad and Tobago 


Mongolia 


UA 


Ukraine 


Mauritania 


UG 


Uganda 


Malawi 


US 


United States of America 


Mexico 


uz 


Uzbekistan 


Niger 


VN 


Viet Nam 


Netherlands 


YU 


Yugoslavia 


Norway 


ZW 


Zimbabwe 



New Zealand 
Poland 
Portugal 
Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 



wo 00/52210 



PCTAJSOO/05271 



DESCRIPTIQN 



10 



METHODS FOR TARGETING RNA MOLECULES 



RELATED APPLICATION 
This application claims the benefit of Shen et aL. U.S. Provisional 
Application No. 60/122,199, filed March 1, 1999, entitled METHODS FOR 
TARGETING RNA MOLECULES, which is hereby incorporated by reference in its 
entirety, including drawings. 



BACKGROUND 

This application relates to the field of identification of compounds active on 
RNA and of target sites on the RNA. 

The wealth of information gained firom the study of the human genome has 

15 allowed identification of many genes whose inappropriate expression results in 
disease. To exploit this information, several technologies have been developed to 
specifically target gene expression. Modulating the expression of such genes allows 
treatment of the disease. In particular, the strategy of modulating gene expression 
by targeting RNA has spawned several new classes of rationally designed 

20 therapeutics (e.g., ribozymes, DNAzymes, and antisense oligonucleotides). 

However, to date, these technologies have had very limited clinical success. One of 
the reasons for the limited success is due to the diflFicult pharmacological barriers 
these macromolecules must overcome. 

The pharmacological treatment of disease is one of the signal advances in 

25 medicine in the twentieth century. Modem methods of small molecule development 
have also proven highly useful in allied fields such as veterinary medicine and pest 
eradication in agriculture. An ever increasing number of diseases can be 
ameliorated by appropriate drug therapy, reflecting increasing knowledge of disease 
pathophysiology and improved methods for conducting clinical research. As the 

30 sequence of the human (and other) genomes is solved and the molecular 

understanding of disease advances it will increasingly be possible to select optimal 
targets for therapeutic attack. Thus while many current drug treatments act by 
poorly understood mechanisms, and/or provide only symptomatic relief, 
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increasingly it should be possible to reverse disease pathophysiology and provide 
more specific and potent therapeutic relief. To achieve this goal, however, will 
require improved methods of drug development. In particular, the development of 
drug targeting strategies that enable highly specific targeting of selected genes or 
5 gene products will be of great use. 

The information provided and references cited herein are provided solely to 
assist the understanding of the reader, and is not admitted to be prior art to the 
present invention. 

10 SUMMARY OF THE INVENTION 

The present invention provides certain advantageous methods for developing 
biologically active molecules. The invention includes a new method for targeting 
RNA with small molecules, where the small molecule makes contact with both a 
cellular protein and a cellular RNA. The effect of the tripartite complex is to 

1 5 increase the affinity of the complex beyond what could be achieved alone with the 
small molecule and the RNA, This addresses what is believed to be the major 
limitation of current attempts to target RNA with small molecules, the lack of 
specific high affinity binding of small molecules to RNAs, which are much more 
flexible than proteins. This is a generic method for inhibiting any cellular RNA, as 

20 described in the disclosure, including, but not limited to allele specific inhibition or 
inhibition based on targetiiig to identified regions of RNA secondary structure where 
that secondary structure differs between allelic forms. 

The invention also provides a method for identifying RNA targets for allele 
specific therapy. These methods have application in at least two major areas: (1) 

25 autosomal dominant diseases, or diseases of excess gene dosage, where elimination 
of one mRNA (but not both) would have a therapeutic effect and (2) cancer and 
other diseases with LOH where allele specific therapy will provide a differential 
effect on disease tissue compared to normal tissue. Such use of allele specific 
therapy has been described in Housman, U.S. Patent No. 5,702,890, issued 12/30/97 

30 and Housman et al., U.S. Patent Application No. 09/045,053, filed March 19, 1998 
which are hereby incorporated by reference in their entireties including drawings. 
Targeting of different alleles can, for example, involve the use of protein-small 
molelcule complexes as described below, but is not limited to such complexes. For 
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example, different alleles can be targeted with small molecules or with 
oligonucleotide inhibitors. 

The present invention provides a unique strategy for targeting cellular RNAs. 
This new method overcomes the limitations of current approaches with respect to 
5 affinity and specificity. The invention entails formation of a trimolecular complex 
involving the target RNA, a small molecule, and a cellular protein. The recruitment 
of a specific cellular protein to the RNAismall molecule complex has the advantage 
of substantially increasing the area of binding due to contacts between the protein 
and the RNA, and thereby increase the affinity of the complex beyond what could be 
10 achieved alone with the small molecule and the RNA by cooperative binding 

generally involving electrostatic, hydrogen bonding, and hydrophobic interactions. 
These favorable binding interactions will increase the affinity of the complex 
beyond what could be achieved with a small molecule and RNA. 

The present invention also provides a new class of RNA therapeutic targets 
15 that will provide the basis for inhibition, e.g., allele specific inhibition, by small 
molecules, and methods for identifying such targets in candidate RNAs. The basis 
of the invention is the novel observation that single nucleotide polymorphisms in 
mRNAs are associated with altered secondary structures. These structural 
differences, not previously known to be associated with normal allelic variation, 
20 provide scope for allele specific small molecule inhibition of RNA. Also described 
are methods for producing and identifying small molecule inhibitors of RNAs. 

Thus, in a first aspect the invention provides a method for identifying a 
potential small molecule inhibitor, where the inhibition involves the induction of 
formation of a complex of a cellular RNA, a small molecule inhibitor, and a cellular 
25 protein. The potential small molecule inhibitor includes a first and a second moiety 
joined by a linker. The method involves identifying a first moiety which binds to at 
least a portion of a selected cellular protein; and also identifying a second moiety 
which binds to at least a fi^agment of a cellular RNA. Such joint binding indicates 
that the small molecule is a said potential small molecule inhibitor. The inhibitory 
30 ability of the small molecule can be tested by standard methods, e.g., testing the 
level of protein synthesis fi^om a target mRNA. Clearly, the identifications of the 
moieties can be performed in either order. The method can also involve selection of 
a hnker or provision of a standard linker, e.g., based on prior screening. In certain 
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cases the linker need not be separate from the first or second moiety, but may be a 
part of one or both moieties. 

In this context, the term "potential small molecule inhibitor" refers to a small 
molecule which binds to both a selected protein and a target RNA but for which the 
5 ability to inhibit a biological activity (e.g., reduce the rale or amount of protein 
translation from an mRNA) of the RNA has not yet been tested. Following 
confirmation of such inhibitory characteristic, the small molecule can be referred to 
as a "small molecule inhibitor*' or "inhibitor". 

The term "small molecule" refers to a compound which has a molecular 
10 mass equal to or less than 5000 Daltons (5 kD), preferably less than 3 kD, still more 
preferably less than 2 kD, and most preferably less than 1 kD. In some cases it is 
highly preferred that a small molecule have a molecular mass equal to or less than 
700 Da. 

The term "inhibitor" refers to a compound that reduces to a statistically 

15 significant extent at least one activity of a reference compound. In the context of 
this invention, an inhibitor generally reduces an activity of an RNA, most 
conunonly, but not limited to, reducing or eliminating the rate and/or extent of 
translation of an mRNA, reducing or eliminating processing of a pre-mRNA to 
produce a mature RNA; reducing or eliminating transport of an mRNA from nucleus 

20 to cytoplasm, or reducing or eliminating a regulatory function of an RNA molecule. 
Preferably the reduction in activity is sufficient to detect based on gross observation 
of cellular properties, e.g., growth rate, survival, morphology, and is preferably 
sufficient and of a nature to produce a therapeutic response in a patient in whom 
reduction in the expression of the target gene would be beneficial. 

25 In the context of this invention, the phrase "induction of formation of a 

complex" refers to a property of a small molecule which results in a structure which 
includes multiple molecules, especially a protein-small molecule-RNA complex, 
where the components other than the small molecule would not associate with each 
other or not associate with each other in the same physical relationship to any 

30 appreciable extent in the absence of the small molecule or other component which 
induces formation of the complex. In some cases other molecules may be present in 
the complex. 
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In the context of this invention, the term "trimolecular complex" refers to an 
association or complex which includes a selected protein, a small molecule, and a 
target RNA. Other components may also be present, e.g., additional proteins or 
polypeptides, small ions, or water. Preferably, however, the selected protein and the 
5 target RNA are the only macromolecules present in the complex. Preferably the 
stably associated complex consists essentially of or consists of the specified types of 
molecules. 

In preferred embodiments, the identifying of the first moiety involves 
identifying a known small molecule ligand of the cellular protein. By utilizing a 
10 known small molecule ligand (including analogs which retain binding to the 

protein), it is not necessary to search for a binding moiety, but only to confirm that 
binding occxirs to the selected protein when the ligand is attached to the linker and 
the second moiety. 

In preferred embodiments, the method involves providing a small molecule 
1 5 library containing compounds comprising or consisting essentially of the first 

moiety and the linker attached to variable second moieties, and screening the library 
to identify one or more molecules such that the second moiety binds to the target 
RNA. Preferably the identifying a second moiety involves providing a small 
molecule library in which compounds have a first moiety which binds to said 
20 selected cellular protein, a linker, and a variable second moiety; and screening 

compounds of the library to identify a compound which binds to the target RNA or 
fi'agment thereof. The fi-agment includes sufficient sequence to retain the 
characteristics utilized for binding of such molecules to the intact RNA, e.g., local 
secondary structure. 

25 In reference to selected proteins and target RNA for binding of a first moiety 

and a second moiety respectively, the terms "fi-agment" and "portion" refer to a part 
of the sequence of the intact protein or RNA which retains properties of the 
molecule important for selecting binding. Preferably the fragment or portion retains 
the local secondary structure as in the intact molecule. For proteins, a portion is 

30 preferably at least 10 amino acid residues in length, more preferably at least 20, and 
still more preferably at least 40 amino acid residues. RNA fiagments are preferably 
at least 20 nucleotides in length, more preferably at least 50 nucleotides in length, 
and still more preferably at least 100 nucleotides in length. 
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Reference to "variable" moieties or linkers in a plurality of compounds or 
molecules or in a library of such compounds means that different molecules in the 
group have different moieties or linkers as specified. 

Conversely, the term "fixed" in connection with a moiety^pr linker in a 
5 plurality of compounds (including in a library, e.g., a combinatorial library) means 
that compounds in the plurality have the same specified moiety or linker. 

In preferred embodiments the method involves identifying the first moiety by 
providing a small molecule library of compounds which have a variable first moiety, 
a linker, and a fixed second moiety, and screening compounds of the library to 
10 identify a compound which binds to the selected cellular protein or portion thereof. 

In cases where a knovm ligand for a selected protein is used, preferably the 
protein and its ligand are selected firom adenosine triphosphate analogs, which could 
sequester one or more cellular ATPases; flavin adenine dinucleotide or FAD 
analogs; nicotinamide or nicotinamide analogs; folates or folate analogs, which 
1 5 could sequester proteins such as dihydrofolate reductase, thymidylate synthase or 
other abundant proteins which require folate as a cofactor; cyclosporin, FK506, 
rapamycin, and coumermycin or their synthetic analogs, which bind to FK506 
Binding Protein (FKBP). 

In preferred embodiments, the process of identifying the first or second 
20 moieties or the linker involves two or more iterations of at least one of the 

identification steps. This can be performed to improve the characteristics of the 
small molecule, e.g., the binding affinity for either or both of the selected protein or 
target RNA or to improve overall complex stability. 

In preferred embodiments, the protein constituent of the trimolecular 
25 complex is a cellular protein that is present in the cellular compartment(s) in which 
the target RNA is present. 

Preferably the protein constituent of the trimolecular complex is a protein 
with known RNA binding activity. Such selection can be advantageous as it is 
aheady known that the protein has properties which allow stable interaction with 
30 RNA, an interaction which can be utilized in the formation of the desired complex. 

Preferably the protein constituent of the trimolecular complex is an RNase., 
for example an RNase fi-om one of the classes RNaseH type 1 , RNaseH type 2, and 
RNaseL; or an RNA helicase. 
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In preferred embodiments the protein constituent of the trimoiecular complex 
is ah DNA binding protein. Such molecules also provide advantageous nucleic acid 
binding properties. 

In preferred embodiments, the protein constituent of the trimoiecular 
5 complex has a two or more basic residues within 1 0 Angstroms of the small 

molecule binding site thereby providing favorable charge interactions with the target 
RNA. 

As indicated, the molecules are small molecules, which preferably do not 
exceed 4000 Daltons in molecular mass, more preferably do not exceed 3000 

1 0 Daltons, still more preferably do not exceed 2000 Daltons, and still more preferably 
do not exceed 1 000 Daltons, or even 700 Daltons. 

In preferred embodiments, the targeted cellular RNA is a pre-mRNA or an 
mRNA. The term "mRNA" is used in its usual sense as understood by those skilled 
in the art. The term "pre-mRNA" refers to an RNA transcript which will undergo 

1 5 processing to become an mRNA. Conmionly the pre-mRNA contains additional 
RNA sequence which will be removed to form the mature mRNA. 

In preferred embodiments the targeted cellular RNA has at least one 
polymorphism that results in an altered secondary structure and the altered 
secondary structure is the target of the RNA binding moiety of said small molecule. 

20 In connection with an RNA, a "sequence polymorphism" or **polymorphism" 

is a difference in nucleotide sequence between RNAs transcribed from different 
allelic forms of a particular gene. Most often, a polymorphism is a single nucleotide 
substitution. A polymorphism, either alone or in conjunction with one or more other 
polymorphisms can result in a change in RNA secondary structure, e.g., as shown by 

25 changes in digestion with various RNases. 

Preferably the targeted cellular RNA is an RNA bearing a mutation that 
alters secondary structure of the RNA, or that exists on the same allele as another 
site of variation that alters the secondary structure of the RNA. 

The term "allele" refers to one specific form of a gene within a cell or within 

30 a population, the specific form differing from other forms of the same gene in the 
sequence of at least one, and frequently more than one, variant sites within the 
sequence of the gene. The sequences at these variant sites that differ between 
different alleles are termed "variances", "polymorphisms", or "mutations". The 
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term "alternative allele", "alternative form", or "allelic form" refers to an allele that 
can be distinguished from other alleles by having distinct variances at at least one, 
and frequently more than one, variant site within the gene sequence. 

In preferred embodiments, the method further involves providing a plurality 
5 of small molecules which contain a plurality of different linkers and a first moiety 
identified as described above, or a second moiety identified as described above or 
both; and screening the plurality of small molecules to identify a potential small 
molecule inhibitor with advantageous binding or pharmacologic characteristics. The 
further step of varying the linker can allow selection of a linker which allows more 
1 0 beneficial presentation or association of the protein and the RNA. 

In preferred embodiments, the identification of the second moiety involves 
providing a small molecule library containing compounds containing fixed first 

ft 

moiety and linker attached to variable second moieties; and screening the library to 
identify one or more molecules in which the second moiety binds to the target RNA. 
15 Preferably the identification of the first moiety involves providing a small molecule 
library of compounds which contain a variable first moiety, a linker, and a fixed 
second moiety identified as just described; and screening compoimds of the hbrary 
to identify a compound which binds to the selected cellular protein or portion 
thereof. 

20 In a second aspect, the invention provides a method for identifying an RNA 

target for an allele specific therapeutic small molecule, e.g., an inhibitor, by 
identifying at least one sequence variance in the RNA target in a population of 
interest, determining whether the RNA secondary structure of any sequence variants 
identified differs between variant alleles. A difference in the RNA secondary 

25 structure is indicative that the sequence variance is a potential target for an allele 
specific therapeutic small molecule. 

An "allele specific inhibitor" or "variance specific inhibitor" is a drug or 
inhibitor that inhibits the activity of one alternative allele of a gene to a greater 
degree than at least one other alternative allele. The difference in activity is 

30 coimnonly determined by the dose or level of a drug required to achieve a 

quantitative degree of inhibition. A commonly used measure of activity is the IC50 
or concentration of the drug required to achieve a 50% reduction in the measured 
activity of the target gene. 
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Preferably an allele specific inhibitor will have at least twice the activity on 
the target allelic form than on a non-target allelic form, more preferably at least 5 
times, still more preferably at least 10 times, and still more preferably at least 50 
times, and most preferably at least 1 00 times. This can also be expressed as the 
5 sensitivities of the different allelic forms to the inhibitor. Thus, for example, it is 
equivalent to state that the target allelic form is most preferably at least 100 times as 
sensitive to the inhibitor as a non-target allelic form. The activity of an inhibitor can 
be measured either in vitro or in vivoy in assay systems that reconstitute the in vivo 
system, or in systems incorporating selected elements of the complete biological 
1 0 system. For use in inhibiting cells containing only the target allelic form rather than 
cells containing at least one non-targeted allelic form, the difference in activity is 
preferably sufficient to reduce the proliferation rate or survival rate of the cells 
having only the targeted allelic form to no more than one half of the proliferation 
rate or survival rate of cells having at least one non-targeted allelic form. More 
15 preferably, the fraction is no more than 1/5 or 1/10, and still more preferably no 
more than 1/20, 1/50, 1/100, or even lower. 

In this context, the term "population'' refers to a group of individuals 
(generally a group of humans) distinguishable from other groups. Bases for 
distinguishing can, for example, include geography, familial relationship, racial 
20 background, age, gender, and combinations of these factors. A "population of 
interest" is thus a group of individuals which are relevant to identification of a 
target, a therapeutic or complex-forming agent, or a method of using or making such 
agents or other aspect of this invention, e.g., a group to which a patient belongs, in 
most cases a population will preferably encompass at least ten thousand, one 
25 himdred thousand, one million, ten million, or more individuals, with the larger 
numbers being more preferable. In special circumstances, diseases will occur with 
high fi-equency in specific geographical regions or within specific familial, racial, or 
cultural groups, and a relevant population may usefully be considered to be a smaller 
group. 

30 Preferably, the sequence variance to be targeted occurs in an allele fi-equency 

range between 0.1:0.9 and 0.5:0.5 in a population of interest. 

The term "allele fi-equency" refers to the fi-action (or fi-equency of 
occurrence) of a specific allele as compared to all alleles in a population. It is 
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recognized in the art that the heterozygote frequency and allele frequency are related 
and, for certain alleles, can be described by Hardy Weinberg equilibrium 
calculations. It will also be recognized that sequence variances that occur at high 
frequency in the population are commonly not deleterious to the health of the 
5 individuals who carry these genes and are commonly not disease genes or mutations 
that are associated with disease. 

In preferred embodiments, the determination of the RNA secondary structure 
is performed using one or more structure-specific enzymes, e.g., a nuclease, 
preferably selected from Tl, T2, SI, U2, CL3, VI, A, PhyM, N.c. nuclease or 

1 0 RNase; or by chemical probing with a chemical selected from the group consisting 
of dimethyl sulfate, diethylpyrocarbonate, CMCT, kethoxal, bisulfite, 
ethyhiitrosourea, MPE-Fe(II), and Fe(II).EDTA. 

Allele specific targeting can advantageously be applied in a case where the 
target RNA is an allele that, if reduced in abundance, would at least amehorate a 

15 disease state. The amelioration can, for example, be symptomatic relief, reduction 
or elimination of one or more deleterious effects of the disease, or even cure of the 
disease. 

Likewise, the target RNA can be an allele associated with an autosomal 
dominant disease, such as Huntington's disease, or other diseases shown in Table 1. 

20 The target RNA can also be an allele of an essential gene which undergoes 

loss of heterozyosity in a proliferative disorder, e.g., a cancer. For example, the 
targets identified by the methods herein can advantageously be utilized in the 
therapeutic approach described in Housman, supra. 

The term "proliferative disorder" refers to various cancers and disorders 

25 characterized by abnormal growth of somatic cells leading to an abnormal mass of 
tissue which exhibits abnormal proliferation, and consequently, the growth of which 
exceeds and is uncoordinated with that of the normal tissues. The abnormal mass of 
cells is referred to as a **tumor", where the term tumor can include both localized 
cell masses and dispersed cells. The term "cancer" refers to a neoplastic growth and 

30 is synonymous with the terms "malignancy", or "malignant tumor". The treatment of 
cancers and the identification of anticancer agents is the concern of certain preferred 
embodiments of the aspects of the present invention. Other abnormal proliferative 
diseases include "nonmalignant tumors", and "dysplastic" conditions including, but 
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not limited to, leiomyomas, endometriosis, benign prostate hypertrophy, 
atherosclerotic plagues, and dysplastic epithelium of lung, breast, cervix, or other 
tissues. Drugs used in treating cancer and other non-cancer proliferative disorders 
commonly aim to inhibit the proliferation of cells and are commonly referred to as 
5 antiproliferative agents. 

In preferred embodiments, the allele specific small molecule is a small 
molecule inhibitor containing a first moiety which binds to a selected cellular 
protein or a portion thereof; a second moiety which binds to a target cellular RNA or 
a fragment thereof, and a linker connecting the first moiety and the second moiety, 
10 where the small molecule inhibitor can bind to the selected cellular protein and to 
the target cellular RNA to form a complex. The molecule can be as further 
described for those identified in the first aspect above and for corresponding 
pharmaceutical compositions and complex-forming molecules. 

In a third aspect, the invention provides a method for identifying an allele 
15 specific RNA binding small molecule for a target RNA, involving the steps of 
identifying at least one sequence variance in the RNA target in a population of 
interest having secondary structure differences between variant alleles, screening the 
two RNA structures for a said sequence variance or variances for binding to or 
inhibition by a plurality of small molecules; and identifying a compound that 
preferentially binds to or inhibits one of said RNA structures, where the binding or 
inhibition is mdicative that the compound is an allele specific RNA binding small 
molecule. Preferably the plurality of small molecules are elements of a small 
molecule combinatorial library. 

Preferably the method includes determining whether the allele specific RNA 
binding small molecule provides a cellular activity, by contacting cells producing 
the target RNA with the allele specific RNA binding small molecule; and 
determining whether the small molecule produces a change in cellular activity. 
Thus, the method uses a cellular assay to identify cellular effects. Preferably the 
cellular effect or activity is a decrease in expression of the target RNA or the 
translation product. 

As in preceding aspects, the target RNA is preferably a pre-mRNA or an 
mRNA. 
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In preferred embodiments, the plurality of small molecules is a plurality of 
potential small molecule inhibitors each comprising a first moiety which may be the 
same or different and a second moiety which differs between the pluraUty of small 
molecules and a linker joining the first moiety and the second moiety. The first 
5 moiety is selected to bind to a selected cellular protein or a portion thereof; and the 
second moiety is selected to bind to a target cellular RNA or a fi-agment thereof, 
thereby forming a trimolecular complex. 

Additional preferred embodiments are as described for the first aspect above. 
In a fourth aspect, the invention provides a pharmaceutical composition 
10 which includes a small molecule inhibitor where the inhibitor includes a first moiety 
which binds to a selected cellular protein or a portion thereof; a second moiety 
which binds to a target cellular RNA or a fi-agment thereof, and a linker connecting 
said the moiety and the second moiety, where the small molecule inhibitor can bind 
to the selected cellular protein and to the target cellular RNA to form a complex. 
1 5 The composition also includes a pharmaceutically acceptable carrier, excipient, or 
diluent. Generally the inhibitor will be a synthetic molecule. 

In the context of the inhibitor or binding compounds of this invention, the 
term '^synthetic" indicates that the compound has been prepared in whole or in part 
by the intervention of humans, and is not a compound naturally produced by a 
naturally occurring organism. A portion or even most of a synthetic molecule can be 
produced by an organism, and then modified by synthetic methods. Alternatively, 
most or all of a molecule can be prepared by synthetic methods, and the resulting 
molecule is then referred to as "fiilly synthetic". 

In certain cases, a naturally occurring molecule may be found which has the 
complex-forming properties described. Such a molecule could be used in the 
methods described herein for using compounds which inhibit or bind RNA, or could 
be modified to alter its binding and/or activity characteristics. 

Preferably the selected protein is a himian protein; preferably the target RNA 
is a human RNA. 

Preferably the small molecule inhibitor inhibits expression of said target 

RNA. 

In preferred embodiments the small molecule inhibitor has differential 
activity on variant forms of the RNA differing in secondary structure, where the 
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difference in secondary structure is related to at least one sequence variance. 
Preferably the targeted cellular RNA has a polymorphism that results in an altered 
secondary structure and the altered secondary structure is the target of the RNA 
binding moiety of said small molecule. 
5 Also in preferred embodiments, the inhibitor is a compound as identified in 

the first aspect above and/or the target is as identified in the second aspect. 

In a fifth aspect, the invention provides a method for inhibiting expression of 
an mRNA. The method involves contacting cells normally expressing the mRNA 
with a small molecule inhibitor. The inhibitor has a first moiety which binds to a 
1 0 selected cellular protein or a portion thereof; a second moiety which binds to a target 
cellular mRNA, and a linker connecting the first moiety and the second moiety. The 
small molecule inhibitor can bind to the selected cellular protein and to the target 
cellular mRNA to form a complex. 

In preferred embodiments, the molecule and/or target is as described for 
1 5 embodiments of aspects above. 

In a sixth aspect, the invention provides a method for treating an autosomal 
dominant or gene dosage disease, by administering to a patient suffering from such a 
disease a therapeutically effective amount of an allele specific RNA inhibitor 
preferentially active on an allehc form of a target RNA in which at least one 
20 sequence variance provides a difference in secondary structure and wherein the 
allele specificity involves that difference in secondary structure. 

In preferred embodiments, the disease is an autosomal dominant disease, and 
the patient is heterozygous for the gene corresponding to the target RNA. 

Also in preferred embodiments, the disease is a gene dosage disease, and 
25 said administering is adjusted to reduce but not eliminate expression of the gene. 
Preferably the patient is heterozygous for the gene and the inhibitor preferentially 
reduces or eliminates expression of only one allele. 

The term "gene dosage disease'* refers to a disease which is at least partially 
has a cause based on overactivity of a product of a gene, e.g., overexpression of the 
30 gene. 

A "therapeutic effect" results, to some extent, in a measurable response in 
the treated disease or condition. Thus, a therapeutic effect can, for example, include 
a cure, or a lessening of the growth rate or size of a lesion such as a tumor, or an 
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increase in the survival time of treated patients compared to controls, a relief of a 
deleterious symptom, or a slowing of the progression of a disease, among other 
possible effects. 

The term "therapeutic amount" means an amount which, when administered 
5 appropriately to a patient e.g., a mammal, e.^., a human, suffering from a disease or 
condition, produces a therapeutic effect. 

In preferred embodiments, the molecule and/or target is as described for 
aspects and embodiments above. 

In a seventh aspect, the invention provides a method of treating a patient 
10 suffering from a disease or condition, by administering to the patient an allele 

specific inhibitor of a target RNA where the allele specificity involves a difference 
in secondary structure related to at least one sequence variance. 

In preferred embodiments, the method involves determining the presence of 
the at least one sequence variance in the patient. 
15 In preferred embodiments, the inhibitor and/or the target is as described for 

aspects or embodiments above. 

In an eighth aspect, the invention provides a method for producing a 
pharmaceutical agent, by identifying a small molecule inhibitor of a cellular RNA. 
The inhibitor induces formation of a complex with a cellular protein and the 
20 inhibitor includes a first moiety and a second moiety and a linker joining the first 
moiety and the second moiety. The method involves identifying a first moiety 
which binds to a selected cellular protein or a portion thereof; and identifying a 
second moiety which binds to a target cellular RNA or a fragment thereof, and 
synthesizing the small molecule inhibitor in an amount sufficient to provide a 
25 therapeutic response when administered to a patient. The identification can also 
involve selecting a linker which allows advantageous complex formation. 

In preferred embodiments, the inhibitor is preferentially active on an allelic 
form of an mRNA in which at least one sequence variance provides a difference in 
RNA secondary structure. 

30 Further preferred embodiments include embodiments as specified in the first 

and/or second aspects. 
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In the methods of inhibiting an RNA and/or methods of treating a patient, a 
compound may be used in various ways, for example, the compound may be used 
alone, or may be used as part of a pharmaceutical composition. 

In a ninth aspect, the invention provides a complex-forming molecule which 
5 includes a first moiety which binds to a selected cellular protein or a portion thereof; 
a second moiety which binds to a target cellular RNA or a fragment thereof, and a 
linker connecting said first moiety and said second moiety. The small molecule 
inhibitor can bind to the selected cellular protein and to the target cellular RNA to 
form a complex. 

1 0 In preferred embodiments the molecule will inhibit expression of a target 

mRNA under translation conditions. 

"Translation conditions" refers to conditions such that translation would 

occur from a particular selected mRNA in the absence of an inhibitor of translation. 

Thus, such conditions can include where an in vitro translation system is present as 
15 well as in vivo conditions. 

In preferred embodiments, the selected cellular protein is an RNA binding 
protein or a derivative thereof. 

Further preferred embodiments are as described for the inhibitors in 
pharmaceutical compositions herein and/or has components, properties, targets or 
binding pairs as otherwise described herein. 

By "comprising" is meant including, but not limited to, whatever follows the 
word "comprising". Thus, use of the term "comprising" indicates that the hsted 
elements are required or mandatory, but that other elements are optional and may or 
may not be present. By "consisting of* is meant including, and limited to, whatever 
follows the phrase "consisting of\ Thus, the phrase "consisting of indicates that 
the listed elements are required or mandatory, and that no other elements may be 
present. By "consisting essentially of is meant including any elements listed after 
the phrase, and limited to other elements that do not interfere with or contribute to 
the activity or action specified in the disclosure for the listed elements. Thus, the 
phrase "consisting essentially of indicates that the listed elements are required or 
mandatory, but that other elements are optional and may or may not be present 
depending upon whether or not they affect the activity or action of the listed 
elements. 
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Other features, advantages and embodiments will be apparent from the 
following Detailed Description of the Preferred Embodiments and from the Claims. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1, Nuclease SI mapping of allelic mRNA structures. -SI, control 
reaction without SI added; +S1, Partial digestion by SI at ST'^C with increasing SI 
concentration; + OH', alkaline hydrolysis reaction; +T1(A), partial digesion by Tl at 
SS^'C to create a G-ladder to allow determination of cleavage sites; -T1(A), control 
10 reaction without Tl added, a. Mapping of accessible single-stranded regions of the 
U- and C-alleles of a 1 13-mer fragment (nucleotides 951-1063) of AARS mRNA. b. 
Mapping of accessible single-stranded regions of the U- and C-alleles of a 544-mer 
fragment (nucleotides 1378-1921) of RPA70 mRNA. Polymorphic positions are 
indicated in boldface and labeled SNP. 

15 

Fig. 2. Proposed secondary structural models of the polymorphic regions of 
the pairs of mRNA alleles. The polymoiphic bases are shown boldfaced and labeled 
SNP. The model building was assisted by the use of computer program MFOLD (M. 
Zuker, Washington University, St. Louis, MO). Only modeled regions supported by 

20 structural mapping data are shown, a. Proposed allelic structxu-es of the polymorphic 
region of AARS mRNA. The models are consistent with the data on structural 
mapping by nuclease S 1 and RNase Tl . The black bar highlights the single-stranded 
region around nucleotide 999 of the U-allele. b. Proposed allelic structures of the 
polymorphic region of RPA70 mRNA. The models are consistent with the data on 

25 structural mapping by nuclease S 1 . The black bar highlights the single-stranded 

■ 

region around nucleotide 1656 of the U-allele. 

Fig. 3. E. coli RNase H digestion of both alleles of the 544-mer fragment of 
the RPA70-1674U/C mRNA. a. Nucleotide sequence of the PS-oligos used in the 
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assay. Oligos 1 and 2 target the polymorphic site; the bases opposing the 
polymorphism in the mRNA are shown in boldface. Oligos 3 to 6 target the site of 
different allelic structures around nucleotide 1656. The terminal residues of oligos 1, 
2, 3 and 6 are labeled according to the positions of the complementary nucleotides in 
5 RPA70 mRNA. Oligos 4 and S share the same target hybridization sequence as 3, 
but with additional nucleotides (shown underlined) added to facilitate folding into 
hairpin structures for intended enhancement in allele discrimination, b, E. colt 
RNase H digestion pattems of the 5'-^^P-labeled U- and C-alleles hybridized to the 
oligos in a. Cleavage sites were confirmed by rurming on the same gel RNase Tl 
10 digestion ladders for the two alleles (not shown): they are centered at position 1674 
for oligos 1 and 2, 1650 for oligos 3-5, and 1654 for oligo 6. c. Comparison of 
percent cleavages of the U- and C-alleles calculated jfem b. Oligo-dependent- 
cleavage bands were integrated as a percentage of total RNA in each lane with the 
overlapping background-cleavage bands subtracted. Variations in sample loading to 
1 5 each lane were internally corrected by measuring percent cleavage instead of 
absolute band intensity. 

Fig. 4. Northem blot analysis of oligonucleotide-directed cleavage of 
endogenous RPA70 mRNA in human cell extracts. The extracts were prepared from 
HeLa and Calu-1 cells that express only the 1674U- or the 1674C-allele of the 
RPA70 mRNA, respectively. In addition to PS-oligos 1, 2, 3 and 6 as described in 
Fig. 3, oligo 7 (5* TGGTCTGCAGTTAGGGTCAG 3') was used in the assay as a 
non-allele-specific control oligo that targets nucleotides 2230-2249 of the 3*-UTR of 
RPA70 mRNA. a. Targeting the 1674U/C polymorphic site by oligos 1 and 2. b. 
Targeting the site of different allelic structures around nucleotide 1656 by oligos 3 
and 6. Reactions were done in duplicates. The averaged data points of the duplicates 
are plotted with standard deviations as error bars, and are shown below the 
corresponding Northem blots. Intact RPA70 mRNA levels were normalized to 
GAPDH levels and expressed as a percentage of no-oligo control for each 
experiment. 
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Fig. SA and 5B show the general structures of certain species which can be 
used as linkers in the present invention. 

5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The development of therapeutic compounds has been and for the foreseeable 
future will continue to be a costly and time consuming process with only a modest 
rate of success. The vast majority of current drugs, as well as the majority of drugs 

10 in development are small molecules directed to protein targets. Small molecules 
have a number of attractive pharmaceutical properties compared to other classes of 
therapeutic compounds, including high potency, oral availability, effective access to 
most cellular compartments, economical manufacturing and formulation costs and 
stability. However, some limitations of small molecule drug development are that 

15 (1) current technology is best suited to protein targets, despite the fact that in some 
diseases the most desirable therapeutic end may be total elimination of a particulsu* 
protein, (2) some protein targets are not amenable to small molecule modulation, 
often because they lack the hydrophobic pockets characteristic of good small 
molecule binding sites, (3) cross reaction of small molecule compounds with cellular 

20 macromolecules other than the target can lead to unintended, and often clinically 
unacceptable, effects. 

Methods for developing inhibitors of specific RNAs would be of great use in 
addressing the above limitations of small molecule therapeutics. This fact has not 
escaped notice; there have been a variety of attempts to develop therapeutic 

25 compounds that target RNA. Most of the effort has focused on non-small molecule 
compounds such as antisense oligonucleotides, peptide nucleic acid, ribozymes, 
DNAzymes and similar nucleic acid based compounds. These polymeric 
compounds do not have most of the advantageous pharmaceutical properties of 
small molecules. There are some preliminary reports of small molecule interactions 

30 with mammalian mRNAs, however these interactions are characterized by 
micromolar to millimolar affinities and do not appear generalizable. 

As the molecular genetic description of disease advances, the level of 
specificity desired from therapeutic compounds increases as well. Current methods 
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do not provide for the level of specific gene or allele inhibition required to treat 
many diseases. The limit of the discrimination problem is the need to distinguish 
allelic forms of RNAs or proteins that may differ by as little as one nucleotide or one 
amino acid. The value of being able to discriminate allelic differences is evident 
when one considers, for example, autosomal dominant diseases or diseases of excess 
gene dosage, where elimination of one mRNA (but not both) would have a 
therapeutic effect. There are no effective pharmaceutical methods for discriminating 
such closely related molecules. 

A good example of a disease in which discrimination of allelic differences 
would be very helpful is Huntington's Disease, an autosomal dominant neurological 
disorder in which the presence of one copy of a Huntinton's allele (with an 
expanded triplet repeat) confers a gain of function phenotype associated with 
neurological dysfunction and degeneration. A variety of human and animal data 
support the view that elimination of the abnormal protein would eliminate the 
disease. However, once a protein is synthesized it is not clear that a small molecule 
can efTiciently induce its prompt destruction. Therefore a more practical route is 
elimination of the RNA encoding the abnormal allele, thereby preventing synthesis 
of the abnormal protein. Prevention of such synthesis is advantageously performed 
at the RNA level, e.g., by promoting rapid destruction of the RNA, by preventing 
processing of pre-mRNA to mature mRNA, preventing transport of mRNA from 
nucleus to cytoplasm, or preventing translation of mRNA to form protein. 

A nimiber of additional exemplary autosomal dominant diseases are shown 
in the following table. The information is also obtainable from other sources, for 
example at the electronic address shown below the table. 

TABLE 1 



Table of Autosomal Dominant Diseases 



OMIM 
Number' 


Gene 


118100 


CERVICAL VERTEBRAL FUSION 


148210 


KERATITIS-ICHTHYOSIS-DEAFNESS SYNDROME 


601227 


PROGRESSIVE EXTERNAL OPHTHALMOPLEGIA, TYPE 3 


601226 


PROGRESSIVE EXTERNAL OPHTHALMOPLEGIA, TYPE 2 


600363 


SPASTIC PARAPLEGIA 6 


157640 


PROGRESSIVE EXTERNAL OPHTHALMOPLEGIA 
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173100 


ri 1 ui 1 /\iv I u w AJvrioivi jjujd 1 loULA 1 JbD CjKUWTH HORMONE 
DEFiriFNCY 


179800 


DISTAI RFNAT TIJRTIT AR APmn^vTQ 


193100 


VITAMIN D-RF^sI^TANT RTPK'FT^S 


181350 


SCAPin nil lOPFPONIFAI ATPOPHV WTTU n i^j>Twr\o ktxj^t 


182601 


SPA^TTP PAP APT FHT A A 


1 82600 


QpA^Tir* PAP APT FHTA "X 


1 66600 


TVPF IT nQTFnPPTPO^TQ 


1467S0 


TPFTTTTVnQTQ T AA/TT^T TAP 


601544 




137600 


IRIDOGONIODYSGENESIS, TYPE 2 


1 OH JUv 




1S6S80 


MICROCEPHALY 


iZo iUU 


1 UKolON D Yo 1 ONIA 1 




HiiKbDl 1 AR Y MULTI-INFARCT TYPE DEMENTIA 


1 77fi<n 

1 / /ODU 


rJjJbUUUAAN liiUiVlA ELASTICUM 


1 OoOUI 


AUi OSOMAL DOMINANT LEWY BODY IN PARKINSON DISEASE 


OUUIUI 


DEAFNESS, AUTOSOMAL DOMINANT NONSYNDROMIC 

QT7XTCOPTXrCT TD A T O 
OIIIN0UKJJNE.UKAL Z 




H Y rtiK 1 bL.UKi5>M W 1 1 ti ESOPHAGEAL ABNORMALITY AND 

W VPOQP A T^T A Q 




Mlv^KUUliJrJtlALrY Wl 1 H UHUKiORETINOPATHY 




PiT A lV>f r^XTT* "DT A/^T/'I?AXT AXTCAiTTA 

JJlAIVlLirsil->-l5j-fAUJ\xAJN ANJbMlA 






129490 


ECTODERMAL DYSPLASIA 3 


OUl jHj 


NUlNoYNJJKUMlC otNoORlNEURAL 8 DEAFNESS 






104500 


HYPOPLASTIC LOCAL AMELOGENESIS IMPERFECTA 2 


600666 


POLYCYSTIC KIDNEY DISEASE 3 


164100 


NYSTAGMUS 2, CONGENITAL 


130050 


EHLERS-DANLOS SYNDROME. TYPE IV 


550000 


MITOCHONDRIAL DNA BREAKAGE SYNDROME, SECONDARY 
TO NUCLEAR MUTATION 


601251 


RETINAL CONE DYSTROPHY 2 




OMIM 
Number 


Gene 


169500 


AUTOSOMAL DOMINANT OR LATE-ONSET TYPE PELIZAEUS- 
MERZBACHER DISEASE 


134610 


FAMILL\L MEDITERRANEAN FEVER-LIKE SYNDROME WITH 

AMYLOIDOSIS 


193220 


VITREORETINOCHOROIDOPATHY 


601842 


AUTOSOMAL DOMINANT NONSYNDROMIC SENSORINEURAL 12 
DEAFNESS 


142680 


FAMILIAL AUTOSOMAL DOMINANT PERIODIC FEVER 


600994 


AUTOSOMAL DOMINANT NONSYNDROMIC SENSORINEURAL 5 
DEAFNESS 


177735 


PSEUDOHYPOALDOSTERONISM. TYPE I 
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601369 


DEAFNESS, AUTOSOMAL DOMINANT NONSYNDROMIC 


161800 


NEM ALINE MYOPATHY 1 


1 74000 


MEDULLARY CYSTIC KIDNEY DISEASE 1 


600223 


SPINOCEREBELLAR ATAXIA 4 


603563 


SPASTIC PARAPLEGIA 8 


144750 


HYPEROSTOSIS CORTICALIS GENERALISATA, BENIGN FORM OF 

WORTH, WITH TORUS 

PALATINUS 


600165 


NANOPHTHALMOS 1 



http//www.ncbi.nlm.nih.gov/Omim/ 



A second set of diseases in which allele specific inhibition would provide a 
5 therapeutic benefit includes cancer and other diseases with LOH. In these diseases 
there is a genetic difference between normal somatic tissues and disease tissues, and 
exploitation of that difference will provide a differential harmful effect on disease 
tissue compared to normal issue. The inhibitor molecules provided by this invention 
can also be used in such areas, as well as other applications where an inhibitor of an 
10 RNA would be beneficial, especially where an allele-specific inhibitor would be 
advantageous. 

In accordance with the above description, as small molecules often have 
superior pharmacologic properties to polynucleotide therapeutics, there exists 
substantial interest in discovering ways to target specific RNA sequences and 
15 structures using small molecules. In a few limited examples, naturally occurring, 
antibiotics have been found to act by binding the RNA component of 
ribonucleoprotein complexes, e.g., aminoaglycoside antibiotics. (Fourmy et al. 

(1996) Science 274:1367-1371) As such molecules cause global rather than gene- 
specific suppression of gene expression, their use has been limited ahnost 

20 exclusively to antibacterial and antifungal therapy. A more powerful and broad 
application of RNA targeting would entail the discovery of small molecules that 
bind specifically to particular mRNAs or pre-mRNAs and modulate their function 
by, for example, inhibiting or activating translation or splicing. This is especially 
challenging because proteins are believed to displace all but the most strongly bound 

25 ligands from mRNA , and few if any small molecules are known to bind mRNA 
with an equilibrium dissociation constant (Kd) below 100 nM (Chow & Bogdan 

(1997) Chem. Rev. 97:1489-1513). Nonetheless, at least one example exists to 
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demonstrate that even micomolar RNA binders can specifically inhibit the 
expression of niRNA in cells (Mei et al. (1998) Biochemistry 37:14204-14212). 
Additionally, expression of a mRNA containing an engineered Hoechst dye-binding 
aptamer was found to be suppressed by addition of millimolar concentrations of dye 
5 to mammalian cells containing the engineered mRNA (Werstuck & Green (1998) 
Science 282:296-298). Notwithstanding these examples, it is widely recognized that 
RNA represents an especially difficult species to target using small molecules, as 
judged by numerous failures to discover legitimate leads in high-throughput 
screening efforts conducted throughout the pharmaceutical industry over the past 
10 decade. The difficulties in targeting RNA with small molecules most likely arise 
fh)m the fact that highly structured RNAs present few hydrophobic pockets; in 
proteins, such pockets are almost without exception the binding sites for small 
molecules. 

The current understanding of small molecule binding to macromolecules is 
15 largely shaped by the study of small molecule-protein interactions. The 

characteristics of such interactions include: (1) much of the binding energy is 
associated with the exclusion of water by hydrophobic contacts, typically involving 
an invagination of the protein surface (hydrophobic binding pocket) into which the 
small molecule ligand fits; (2) The binding often entails some change in the 
20 orientation of the amino acids comprising the pocket (induced fit); (3) charge 

interactions are generally not a component of high affmity protein - small molecule 
interactions. 

These properties of small molecule-^jrotein interactions indicate that RNA 
are targetable by small molecules as well because RNA shares many of the 

25 properties of proteins that make them attractive for binding small molecules. First, 
RNA fi^equently adopts a shape similar to proteins that bind small molecules. 
Second, the surfaces of RNAs tend to present the sugars and bases, thereby 
facilitating interactions of these groups with small molecules. Such interactions can 
include hydrogen bonding, van der Waals interactions, and electrostatic interactions, 

30 as well as hydrophobic interactions in certain orientations. Third, like proteins, 
RNA structures are plastic, which allows them to mold to the shape of a ligand 
(induced fit). Fourth, the sequence complexity of RNA, and particularly the 
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complexity of higher order RNA structure, creates unique three dimensional 
structures in specific RNAs that are targetable with high specificity. 

Despite these attractive properties of RNAs, the development of small 
molecule inhibitors of RNA has not advanced beyond the identification of some 
5 natural products (aminoglycoside antibiotics) which bind prokaryotic ribosomal 
RNA and inhibit protein translation. The reasons for this include: the hydrophobic 
surfaces of RNA exist amidst a high concentration of polar (edges of bases, 2'-0H 
groups of sugars) and charged (phosphate moieties) groups which are repulsive to 
hydrophobic moieties. Thus it is not surprising that most small molecules that bind 
10 RNA have charged functionality in addition to hydrophobic surfaces. These are the 
characteristics of aminoglycoside antibiotics, for example. The charge on such 
molecules impedes cell membrane penetration and generally confers upon such 
molecules poor pharmacological properties (e.g. poor intestinal absorbtion, poor 
binding to serum proteins that act as carriers). 
15 An important aspect of the present invention is the realization that the 

formation of a trimolecular complex between an RNA, a protein and a small 
molecule can address some of the limitations of small molecule - RNA interactions. 
In particular, favorable electrostatic contacts between a protein and an RNA can 
dissipate charge, thereby creating an improved local environment for hydrophobic 
interactions between the RNA binding moiety of a small molecule and its target 
RNA. There is also a cooperative quality to the binding interactions between (1) 
RNA and protein, and (2) the small molecules of this invention, which link an RNA 
and a protein via interactions with two moieties of the small molecule. 

The favorable electrostatic interactions between protein and RNA which are 
an aspect of this invention will depend on the specific chemical characteristics of the 
target RNA and the target protein. Such interactions are understood by those skilled 
in the art. It is worth noting, however, that most proteins have a preponderance of 
polar and charged residues on their surfaces, and proportionately fewer hydrophobic 
groups. Further, many proteins have patches (in three dimensional space) of basic 
residues in the vicinity of their ligand binding sites. Such proteins would constitute 
one class of preferred protein partners for the small molecules of this invention. 

Thus, the description presented here provides a way to overcome the weak 
interactions that are characteristic between small molecules and RNA, and can 
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provide a mechanism for small molecules to bind to and alter the biology of a 
specifically targeted RNA. As indicated above, in a general aspect, this invention 
includes a RNA target, a cellular protein, and a small molecule that simultaneously 
interacts with both the RNA and the protein bringing them into close proximity. An 
5 important aspect of the invention is that the protein and the RNA, when in close 

proximity, also interact, but only when brought together through the intemiediacy of 
the small molecule, hi other words, the RNA and the protein do not appreciably 
interact in the absence of the small molecule. The resulting trimolecular complex 
(protein-small molecule-RNA), initiated by the small molecule, provides a level of 

10 stability that would be difficult to achieve with the small molecule alone, because 
the ensuing protein-RNA contacts stabilize the overall complex. Harnessing 
"additional" RNA binding affinity by recruitment of cellular protein, can result in a 
binding complex with increased affinity for the targeted RNA. The result of 
complex formation can be to prevent translation, to shorten mRNA half-life, or to 

1 5 induce mRNA cleavage, thereby suppressing expression of the targeted gene. 

Thus, the close juxtaposition of protein and RNA would drive RNA-protein 
interactions that would not otherwise occur. Therefore, the protein that is recruited 
by the small molecule to form the RNA binding complex need not necessarily be 
limited to RNA binding proteins. Proximity can also induce RNA-protein 

20 interactions even involving proteins that are not known to bind RNA with high 
affinity or high specificity. The interactions with RNA may occur by exploiting 
amino acids normally involved in protein-protein interactions or basic residues that 
are positioned close to the RNA. The resulting trimeric complex would have 
specificity (due to the small molecule) and high affinity (due to multiple synergistic 

25 binding interactions with RNA of both the small molecule and protein). Proteins 
that would be advantageous for use in this invention include: 

Abundant proteins (so that protein constituent of complex would not be 
limiting) 

Ubiquitously expressed proteins (so that method can be applied to all target 
30 tissues) 

Proteins with known small molecule ligands (simpHfies method and likely 

provides increased affinity of the protein-RNA interaction component 

of the trimer complex) 
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DNA binding proteins 

Ribonucleases (Rnases) (shorten half-life of targeted RNA, suppressing 

translation and gene expression, does not rely on steric interactions to 
prevent translation) 
5 Proteins that bind cell-permeable small molecule ligands 

A method to enhance the bioactivity of RNA-binding small molecules is 
described. This method is predicated on: (1) identification of a cellular protein with 
favorable characteristics to interact with RNA if brought in close proximity; (2) 
synthesis of a bipartite small molecule possessing an invariant ligand on one end that 
10 interacts with the identified protein and a variable structure (e.g., generated by 
combinatorial chemistry) on the other end; and (3) an assay able to detect specific 
binding of the protein-small molecule composite surface to the targeted RNA. 

General techniques for regulating the proximity and orientation of proteins in 
cells have been developed (Schreiber & Crabtree (1996) Trends Biochem. Sci. 
15 2 1 :4 1 8-422) and are relevevant and useful for this invention. Small molecule 

chemical inducers of dimerization (CIDs), which bind two proteins simultaneously, 
have been used to modulate the activity of several different cellular protein 
interactions (developed (Schreiber & Crabtree (1996) Trends Biochem. Sci. 21 :418- 
422; Spencer et al. (1993) Science 262:1019-1024; Pruschy et ah (1994) Chem. 
20 Biol. 1 :163-172; Spencer et al, (1996) Curr. Biol. 6:839-847; Belshaw e tal. (1996) 
Chem. Biol. 3:731-738; Spencer et al. (1995) Proc. Nat. Acad. Sci. USA 92:9805- 
9809; Holsinger e al. (1995) Proc. Nat. Acad. Sci. USA 92:9810-9814; Luo et al. 
(1996) Nature (London) 383:181-185; Farrar et al. (1996) Nature (London) 383:178- 
181; Belshaw et al. (1996) Proc. Nat. Acad. Sci. USA 93:4604-4607; Harding et al. 
25 ( 1 989) Nature (London) 34 1 :758-760) and show the feasibility of enlisting non- 
physiological interactions to effect physiological changes. 

One such CID, rapamycin, binds both FK506 binding protein (FKBP12) and 
FKB12-rafamycin-associated protein (FRAP) simultaneously, creating a trimeric 
complex. Complex formation inhibits the kinase activity of FRAP, resulting in cell 
30 cycle arrest at Gi. Interestingly, it is the composite surface, with contributions from 
both rapamycin and FKBP12 which mediates the interaction of the complex with 
FRAP. The X-ray crystal structure for this trimeric complex has been solved (Choi 
et al. (1996) Science 273:239-242). The structure shows that the rapamycin is a 
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bipartite molecule interacting with a hydrophobic pocket on each protein. Each end 
of rapamycin fits into each protein allowing alignment and protein-protein 
interaction to the two bound proteins to occur. Although rapamycin interacts 
extensively with both protein partners, the extent of protein interaction is limited. 
5 The structure suggests that the interaction does not require extensive protein 

interactions, because only 3 H-bond contacts and only 400 of solvent-accessible 
surface area participate in the interaction between FRBP12 and FRAP. 

The present invention involves targeting a mRNA sequence, a mRNA 
variance, or a structural variation due to a nucleotide variance with a small molecule. 
10 This can be achieved using a relatively low affinity RNA-binding small molecule 
whose binding is augmented by being presented to the RNA as a complex with a 
protein. To form high affinity complexes with RNA, usually a bipartite small 
molecule is designed to include an invariant protein-specific ligand on one end and 
an element of a combinatorial small molecule library on the other end, connected by 
15 a linker. The bipartite small molecule will bind to its protein and create a composite 
binding surface that would interact with RNA. With binding moieties and linker 
properly selected, "protein highjacking" by the small molecule will induce the 
formation of protein-RNA bonds that would otherwise not occur, resulting in high 
affinity binding to the target RNA. The stability of the complex (mRNA, small 
20 molecule, and protein) will be higher than the binding of either of the components to 
the target mRNA individually and could alter the biology of the target mRNA. 

In a specific application, a protein-small molecule complex is created by 
binding of a specific protein to the invariant specific ligand on one end of the 
bipartite small molecule. Preferably this protein would bind the small molecule in a 
hydrophobic pocket that was surrounded with basic residues favoring interaction 
with nucleic acid. The binding would result in a composite binding surface whose 
characteristics would be contributed by the local amino acid residues and the 
particular chemical entity presented by the variable end of the small molecule. The 
dimeric complex would then be assayed for RNA binding to identify potential 
chemical inducers of protein-RNA dimerization. By analogy to the system 
described for CIDs, the juxtaposition of both macromolecules by the small molecule 
will induce RNA-protein interactions resulting in a high affinity trimolecular 
complex. High affinity complex formation can result in an alteration of the biology 
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of the mRNA and suppression of gene expression. This can occur in at least two 
ways: 

1) If the complex were to form in the 5' UTR, 3* UTR, splicing junctions of 

mRNA, or the coding region of the mRNA, the stabihty of the 
5 complex could inhibit maturation, processing, or translation of the 

mRNA. 

2) If the "highjacked" RNA binding protein also contains a nuclease activity 

(e.g., RnaseL), then the trimerization could direct specific cleavage of 
the target mRNA. 

10 RnaseL represent one potential candidate nuclease whose binding could be 

strengthened in the presence of the appropriate small molecule. RnaseL is unique 
among nucleases in that its activity is ligand dependent. A ligand binding domain 
has been identified in the amino terminal region of this nuclease which, when bound 
with ligand, activates the nuclease activity of the protein. This domain and its 
15 characterized ligand, 2 '-5' oligoadenylate (2-5 A), present a blueprint upon which to 
model a bipartite small molecule that would bind to RnaseL, both activating its 
nuclease activity and targeting it to a specific RNA sumultaneously. For this 
application, a specific bipartite small molecule would be synthesized with a 2-5A 
ligand mimic at one end and elements of a combinatorial library at the other end. 
20 The action of the modified small molecule would take place in two stages: 1) 

recognition of and binding to RnaseL; 2) recriutment and local activation of RnaseL 
to ablate the target mRNA. 

Other RNases could also be used, with or without engineering of ligand- 
dependent nuclease activity, as well as other RNA binding proteins. Such proteins 
have the advantage that the structure is known to be suitable for forming a stable 
complex with RNA, thus it is very probable that that stability can be utilized in 
forming a specific protein-small molecule-RNA complex. The structure can be 
modified as desired to provide a suitably low level of non-specific RNA binding. 
Those skilled in the art are famiUar with the modification and mutation methods 
useful for modifying protein structure and evaluation of the resulting RNA binding 
properties, and further with how to provide a suitable balance between RNA binding 
in the absence of the small molecule and stabilization of the complex. 
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Those skilled in the art also recognize that identification of binding small 
molecules, and the particular binding moieties, which can simultaneously bind to a 
selected protein and a target RNA can proceed in various ways, all of which are 
within the scope of this invention. While it is preferred that screening for binding 
5 occur with small molecules which contain a first and second potential binding 

moieties and a linker joining the two, a binding molecule can also be constructed by 
building up fi-om an initial binding molecule. Thus, for example, a small molecule 
can be identified (e.g., by screening or by identification of a known ligand or 
binding compound) which binds to the selected protein. This molecule can be used 
10 as the first moiety and attached to a variety of linkers and/or attached to the linker 
with various chemistries or attachment locations. The combined first moiety and 
linker molecules can then be screened to identify those which retain binding 
capability. Those which retain binding can then be attached to potential RNA 
binding moieties and screened for binding to the target RNA, either in the presence 

1 5 or absence of bound protein. Similarly, the build-up could proceed fi-om the RNA 
binding moiety and/or could begin with one of the moieties attached to Unker(s). In 
any of the approaches, it can be beneficial to vary the linker (e.g., different linker 
molecules and/or different length linkers) to identify molecules with better 
characteristics (e.g., tighter binding). 

20 Additional information on the construction of small molecule libraries and 

the selection of the first and second moieties and linkers of the described complex- 
forming compounds follows. 
Protein Binding Moietv - Moietv 1 

In one embodiment of the invention the selection of a protein binding moiety 

25 (moiety 1) is achieved by a screening process. The aim of the screening process 
may be to identify a ligand for a protein that does not have a known small molecule 
ligand, or to optimize a known ligand. For example, the protein RNAse L has a 
known ligand, 2*-5' oligoadenylate (2-5A), however it may be desirable to identify a 
smaller, or less charged molecule that interacts with the 2-5 A binding site of RNAse 

30 L. Such a molecule can be identified by a screening assay in which RNAse L is 
fixed in the well of a microtiter plate and tested for binding (and, optionally, 
activation of RNAse activity) by a series of 2-5A analogs. One skilled in the art will 
recognize that nucleotide analogs, or oligomerized nucleotide analogs (with 
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modification of the phosphodiester bond, the sugar, the base, or some combination 
of these groups) would be a useful starting point for such a library, however other 
approaches are also useful, such as non-nucleotide structures which replicate some 
of the structural elements of 2'«5' polynucleotides. Alternatively, a synthetic or 
5 natural product library can be screened without respect to the structure of the natural 
ligand. 

In general the composition of a library from which to select a binding moiety 
1 should be guided by the principles which inform design of small molecule libraries 
for protein targets. Guidance on the construction of such libraries is provided in a 
10 number of recent texts including the following. Terrett, N. K, and Terret, N., (1998) 
Combinatorial Chemistrv (Oxford Chemistry Masters) Oxford Univ Press, ISBN: 
0198502192. Terrett, Nicholas K. (1998) Combinatorial Chemistrv (Oxford 
Chemistry Masters, 2), Oxford Univ Press, ISBN: 0198502206. Wilson, Stephen R. 
(Editor), and Anthony W. Czamik (Contributor) (1997) Combinatorial Chemistrv : 

1 5 Svnthesis a nd Application. John Wiley & Sons, ISBN: 047 1 12687X. Bunin, Barry 
A. (Editor) (1998) The Combinatorial Index, Academic Press, ISBN: 0121413403. 
Obrecht, Daniel and Jose M. Villalgordo (1998) SoUd-Supported Combinatorial and 
Parallel Svnthesis of Small-Molecular- Weight Compound Libraries (Tetrahedron 
Organic Chemistry Series, V. 17), Pergamon Press, ISBN: 0080432581. 

20 In another aspect of the invention the protein binding moiety is a known 

small molecule ligand of a cellular protein or an analog of such a ligand. Examples 
of specific small molecules which may be useful for moiety 1 include: adenosine 
triphosphate analogs, which could sequester one or more cellular ATPases; flavin 
adenine dinucleotide or FAD analogs; nicotinamide or nicotinamide analogs; folates 

25 or folate analogs, which could sequester proteins such as dihydrofolate reductase, 
thymidylate synthase or other abundant proteins which require folate as a cofactor; 
cyclosporin, FK506, rapamycin, and coumermycin or their synthetic analogs, which 
bind to FK506 Binding Protein (FKBP). Dimerization schemes based on the latter 
four natural products have been described. Recently totally synthetic, cell- 

30 permeable analogs of these componds, with superior phannacological properties, 

have been produced. (Amara, J.F., Clackson, T., Rivera, V.M., Guo, T., Keenan, T., 
Natesan, S., Pollock, R., Yang, W., Courage, N.L., Holt, D.A. and M. Oilman 
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(1997) Proc. Natl. Acad, ScL U. 5. A, 94(20): 106 18-23.) The compounds of Amara 
et al. are also useful compounds for the present invention. 

Another class of known ligands of cellular proteins are ligands for nuclear 
proteins. Such ligands have the advantage of providing access to pre-mRNA, which 
5 is confined to the nucleus. Pre-mRNA, which includes introns of intron-containing 
geneSi is often larger than processed mRNA, and therefore contains more potentially 
targetable structures. Ligands of nuclear proteins include nucleotide analogs which 
bind to specific DNA repair proteins. Examples of such ligands include 5' 
fluorocytidine, which binds to DNA methyltransferase; ^O-benzylguanine, which 
10 binds methylguanine DNA methyltransferase; and alkylated ethenoguanine, which 
binds to alkyladenine DNA glycosylase. 

Another class of hgands which bind cellular proteins are amino acid analogs 
which bind proteins involved in amino acid metabolism or protein translation. 
Examples of such molecules include borrelidin, which binds histidyl tRNA 

15 synthetase, and histidinol, which binds histidyl tRNA synthetase, as well as the L- 
glutamine antagonists acivicin, DON and azaserine; the aspartic acid analogs PALA 
and L-alanosine, as well as compounds such as buthionine sulfoximine and 
difluoromethylomithine. 

Any of the enumerated ligands, as well as others, could serve as the starting 

20 point for generating a library with the composition: moiety 1 (fixed ligand) + moiety 
2 (library), or the composition: moiety 1 (fixed ligand) -i- linker (fixed or library) + 
moiety 2 (library). Alternatively any of the ligands mentioned could be used as the 
basis for constructing a library of limited diversity in order to identify a more 
advantageous ligand for the selected cellular protein. 

25 In another aspect of the invention a preferred size for moiety 1 is less than 

1,500 Da, a still more preferred size is less than 1,200 Da, a still more preferred size 
is less than 900 Da and a most preferred size is less than 600 Da. 
RNA Binding Moietv - Moietv 2 

The RNA binding moiety is in general any moiety with desirable chemical 

30 and pharmaceutical properties which binds to a specific target RNA with high 
affinity. Such a moiety may be identified fi*om a combinatorial small molecule 
library or a natural product library by a screening assay of the type described below. 
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A preferred moiety 2 is a molecule which can participate in hydrophobic interactions 
with RNA. 

In another aspect of the invention a preferred size for moiety 2 is less than 
1 ,500 Da, a still more preferred size is less than 1,200 Da, a still more preferred size 
5 is less than 900 Da and a most preferred size is less than 600 Da. 
Linker Moietv 

As indicated above, a linker moiety is optional. However, in specific 
instances the linker moiety will be an essential element of the small molecule 
library, and in particular may be important for bringing the target mRNA and protein 
10 together in optimal orientation and at an optimal distance. There may be one, two or 
more linkers joining moieties 1 and 2, depending on the structure of moieties 1 and 2 
and the optimal geometry for the trimolecular complex. In preferred embodiments 
there is one linker. Examples of useful linker chemistries are provided, without 
limitation, in Figure 5. 

15 Linker groups may be varied to achieve different orientations or different 

distances between moieties 1 and 2. For example, the linker group may include a 
planar aromatic ring system (see diagram) such as, for example, a benzene ring. 
Moieties 1 and 2 may be attached to the ring or rings in any of the possible 
orientations, some of which are indicated in the figure. Alternatively linkers may 

20 consist of hydrocarbon chains of various lengths and chemical compositions. 

Several exemplary hydrocarbon chains are shown in the figure, along with some of 
the possible points of attachment for moieties 1 and 2. Another possible linker 
moiety is a substituted sugar or polysaccharide. Exemplary sugars are shown in the 
figure, along with some possible sites for attaching moieties 1 and 2. Another 

25 possible linker moiety is a beta lactam or cepham ring. Examples of such rings are 
shown in the figure, along with some possible sites for attaching moieties 1 and 2. 
Yet another possible linker is an amino acid, an amino acid analog or a short chain 
of amino acid analogs. One skilled in the art will recognize that the exemplary 
linker structures provided are not comprehensive, and that additional variations on 

30 the structures shown, as well as the use of other organic chemical groups of similar 
size may be usefiil to achieve other orientations or distances between moieties 1 and 
2. 
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It may be usefixl to join certain moieties 1 and 2 at more than one site; . that 
is, it may be useful in some instances to use more than one linker. In such cases 
either both or all linkers may be of the same type, or one linker may of one type and 
the second or more linkers may be of another type or types. Linker types include not 
5 only those listed above but other structures that will be evident to one skilled in the 
art. 

Although the linker moiety is intended to join moieties 1 and 2, another 
aspect of this invention is the participation of the linker in contacts with either 
moiety 1 or moiety 2. For example, sugars are known to be capable of binding 
10 RNA. Thus, in some cases, contacts between a substituted sugar and either moieties 
1 or 2 may contribute to the affinity or stability of the trimolecular complex. 

In another aspect of the invention a preferred size for moiety 2 is less than 
700 Da, a still more preferred size is less than 500 Da, a still more preferred size is 
less than 400 Da and a most preferred size is less than 300 Da. 
15 Screening Assavs 

Screening the small molecule libraries of this invention can take place in 
several ways. For example, first, one can hold constant the composition of moiety 1 
and a linker and create a library which varies only at moiety 2. Alternatively just 
moiety 1 is fixed and the library is made by varying the linker and moiety 2. These 

20 same two approaches can be taken with moiety 1 and 2 reversed. Once a ligand is 
selected at one moiety, for example moiety 2, it may be useful to create a 
constrained library with moiety 2 and the linker fixed and moiety 1 only varying to a 
hmited degree, so as to explore the effects of the selected moiety 2 on the binding of 
moiety 1 (even if moiety 1 has been previously selected). Similar experiments can 

25 be performed in which both the linker and moiety 1 are varied. The same libraries 
and screens can be performed with moieties 1 and 2 reversed. 

In one aspect of the invention the linker is the preferred component of the 
library to vary having selected a moiety 1 and a moiety 2. The reason is that the 
apposition of protein and RNA surfaces in the optimal spatial relationship will be 

30 crucial to achieving the maximal possible binding affinity. This optimal relationship 
will entail maximizing attrative electrostatic interactions while allowing for maximal 
hydrophobic contacts by the protein. One skilled in the art will recognize that the 
permutations of linker composition and structure enimierated above are only a 
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fraction of the structures that could be used to bring moieties 1 and 2 into optimal 
alignment. 

Once a library is made it can be screened in several ways. First, a library can 
be screened for binding to its intended target. For example, a library in which moiety 
5 1 varies can be screened for binding to the target protein, or a library in which 
moiety 2 varies can be screened for binding to the target RNA. One convenient 
assay for such screens is the scintillation proximity assay (SPA). In this assay a 
radioactive isotope and a scintillant are brought into proximity by a specific binding 
interaction, allowing the radioemitter to excite the scintillant, which is then detected 

1 0 with sensitive detectors. This arrangement obviates the need to separate free from 
boimd components, and allows real time monitoring of the binding interaction. SPA 
can be carried out more rapidly than most other methods used to monitor binding 
reactions, and is also relatively inexpensive. The methodology 
also lends itself to automation. (For a review of the SPA see: Udenfiiend, S., 

15 Gerber, L, and Nelson, N. (1987) Scintillation proximity assay: a sensitive and 
continuous isotopic method for monitoring ligand/receptor and antigen/antibody 
interactions. Anal Biochem 1 6 1 (2):494-500. For an up to date review of automated 
high throughput screening see: Femandes, P.B. (1998) Technological advances in 
high-throughput screening. Cwrr Opin Chem Biol 2(5):597-603.) 

20 An alternative assay format is to screen a library in the presence of both the 

target RNA and the target protein. This allows the affinity of the final complex to be 
assessed. 

Another useful assay is to screen a library in the presence of both the target 
RNA and the target protein present in a cell extract which contains substantially all 
25 the other cellular constituents which might interact with the target RNA, the target 
protein or the small molecule. 

A final important assay is to screen a library in the presence of cells which 
express both the target RNA and the target protein, and where there is an assay for 
abrogation of RNA fiinction. 



30 



In addition to the provision of particularly advantageous molecules which 
can be used to inhibit an activity or activities of specific cellular RNA molecules, the 
invention is also concerned with the identification and use of targets within and 
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RNA sequence, which are particularly useful for allele specific inhibition of an 
activity of RNA corresponding to a particular gene. In general, these targets are 
related to changes in secondary structure of an RNA species which occur as a result 
of the presence of one or more sequence polymorphisms in an RNA (i.e., differences 
5 in secondary structure between allelic forms of an RNA, especially an mRNA or a 
pre-mRNA. Commonly, the sequence polymorphisms are Single Nucleotide 
Polymorphisms (SNPs). 

SNPs are single base-pair substitutions that occur within and outside genes 
(1-3). Due to the growing recognition that SNPs can be used as genetic markers for 
10 identifying genes that contribute to disease susceptibility or resistance (4, 5), or drug 
response (6, 7), SNP discovery has been markedly accelerated over the past two 
years. Within intragenic regions composed of introns and exons, the average 
frequency of SNP occurrence is approximately 1 in every 500 base pairs, with 
introns more highly variable than exons (2). Within exons, SNPs occur in both 

15 protein-coding regions and the 5'- or 3'-UTR regions, and those in coding regions 
do not always cause amino acid variations. 

Genes containing one or more SNPs can give rise to two or more allelic 
forms of mRNAs. SNPs associated with different allelic forms of mRNA may 
account for allelic variation in the interaction of mRNA with cellular components 

20 involved in its synthesis, maturation, transport, translation or degradation. It has 
been documented that a number of single base-pair substitutions alter or create 
essential sequence elements for splicing, processing or translation of human mRNA 
(1). These SNPs exhibit phenotypic consequences including altered length and/or 
steady-state level of cytoplasmic mRNA. On the other hand, SNPs that do not affect 

25 RNA consensus and protein sequences have not been analyzed in detail. It is 

conceivable that such SNPs could also lead to variant phenotypic effects, most likely 
through non-consensus-dependent mechanisms. 

It has been shown by a growing body of evidence that the folding of mRNA 
influences a diverse range of biological events such as mRNA splicing (8, 9) and 
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processing (10-12), and translational control (13-16) and regulation (17-19). Since 
the structure of mRNA is determined by its nucleotide sequence and its 
environment, we were interested to examine whether the folding of mRNA could be 
influenced by the presence of SNPs. In this report, we describe the analysis of two 
5 previously identified SNPs in coding regions (V. P. Stanton, Jr., unpublished 
results): a U/C transition at nucleotide 1013 of human alanyl tRNA synthetase 
(AARS, GenBank Accession # D32050) and a U/C transition at nucleotide 1674 of 
human replication protein A, 70 kDa subunit (RPA70, GenBank Accession # 
M63488). Both SNPs do not affect amino acid sequences of the respective proteins. 

10 An initial screen of lymphoblasts ftom 36 imrelated individuals (V. P. Stanton, Jr., 
unpublished results) revealed that heterozygotes that carry both the U- and the C- 
alleles and homozygotes that carry only one of the two alleles occur at frequencies 
of 57% and 43%, respectively, for AARS mRNA; and 31% and 69%, respectively, 
for RPA70 mRNA. Using structural mapping and structure-based targeting 

15 strategies, we show that the SNPs have profound effects on the structural folds of 
the mRNAs. Such findings of the contribution of common genetic variation to 
structural diversity of mRNA have not been described previously. These results 
suggest that phenotypic consequences of SNPs could arise from mechanisms that 
involve allele-specific structural motifs in mRNA. 

20 The targets identified as corresponding to sequence polymorphisms are, in 

particular, useful for targeting (including screening and inhibitory use) with the 
complex-forming molecules described herein. However, the use of those targets is 
not limited to targeting with such molecules, but rather can be targeted using any 
type of allele specific inhibitors (see e.g., Housman, supra). 

IS Likewise, the complex-forming molecules described herein are useful for 

targeting to the targets corresponding to RNA secondary structure, but their use is 
not limited to those targets. Instead, such molecules can be used for any target in an 
RNA for which specific binding molecules are developed. 
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Thus, the present invention provides targets and inhibitory molecules which 
are broadly applicable for treatment of diseases in which inhibition of a particular 
RNA would be beneficial, e.g., autosomal dominant diseases, proliferative disorders 
such as cancer, and diseases where reduction of expression of a gene would be 
5 helpful. 

As understood by those skilled in the art, inhibition of an RNA or treatment of a 
disease in a patient can be performed in many different ways, depending in part on 
the particular characteristics of the disease and of the molecule to be delivered or 
administered. Guidance on preparation and administration of therapeutic 
10 compounds and pharmaceutical compositions is provided in the following section. 

Preparation & Administration of Pharmaceutical Compositions 

For the treatment of patients suffering from a disease or other condition in 
which the inhibition or other modulation of an RNA is desired, the preferred method 
of preparation or administration will generally vary depending on the type of 
compound to be used. Thus, those skilled in the art will understand that 
administration methods as known in the art will also be appropriate for the 
compounds of this invention. 

The particular compound that exhibits activity on a target RNAcan be 
administered to a patient either by itself, or in pharmaceutical compositions where it 
is mixed with suitable carriers or excipient(s). In treating a patient exhibiting a 
disorder or condition of interest, a therapeutically effective amount of an agent or 
agents is administered. A therapeutically effective dose refers to that amount of the 
compound that results in amelioration of one or more symptoms or a prolongation of 
survival in a patient. 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures and/or experimental animals, 
e.g., for determining the LD50 (the dose lethal to 50% of the population) and the 
ED50 (the dose therapeutically effective in 50% of the population). The dose ratio 
between toxic and therapeutic effects is the therapeutic index and it can be expressed 
as the ratio LD50/ED50. Compounds which exhibit large therapeutic indices are 
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preferred. The data obtained from these cell culture assays and animal studies can 
be used in formulating a range of dosage for use in humans. The dosage of such 
compounds lies preferably within a range of circulating concentrations that include 
the ED50 with little or no toxicity. The dosage may vary within this range depending 
5 upon the dosage form employed and the route of administration utilized. 

For any compound used in the method of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays. Such information 
can be used to more accurately deteraiine useful doses in humans. Levels in plasma 
may be measured, for example, by HPLC or other means appropriate for detection of 

1 0 the particular compound. 

The exact formulation, route of administration and dosage can be chosen by 
the individual physician in view of the patient's condition (see e.g. Fingl et. al., in 
The Pharniacoloeical Basis of Therapeutics, 1975, Ch. 1 p.l). 

It should be noted that the attending physician would know how to and when 

15 to terminate, interrupt, or adjust administration due to toxicity, or to organ 
dysfimctions. Conversely, the attending physician would also know to adjust 
treatment to higher levels if the clinical response were not adequate (precluding 
toxicity). The magnitude of an administered dose in the management of the disorder 
of interest will vary with the severity of the condition to be treated and to the route 

20 of administration. The severity of the condition may, for example, be evaluated, in 
part, by standard prognostic evaluation methods. Further, the dose and perhaps dose 
frequency, will also vary according to the age, body weight, and response of the 
individual patient. A program comparable to that discussed above also may be used 
in veterinary medicine. 

25 Depending on the specific conditions being treated and the targeting method 

selected, such agents may be fomiulated and administered systemically or locally. 
Techniques for formulation and administration may be foxmd in Alfonso and 
Gennaro (1995). Suitable routes may include , for example, oral, rectal, 
transdermal, vaginal, transmucosal, or intestinal administration; parenteral delivery, 

30 including intramuscular, subcutaneous, or intramedullary injections, as well as 
intrathecal, intravenous, or intraperitoneal injections. 

For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
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Ringer's solution, or physiological saline buffer. For transmucosal administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Use of pharmaceutically acceptable carriers to formulate the compounds 
5 herein disclosed for the practice of the invention into dosages suitable for systemic 
administration is within the scope of the invention. With proper choice of carrier 
and suitable manufacturing practice, the compositions of the present invention, in 
particular those formulated as solutions, may be administered parenterally, such as 
by intravenous injection. Appropriate compounds can be formulated readily using 

10 pharmaceutically acceptable carriers well known in the art into dosages suitable for 
oral administration. Such carriers enable the compounds of the invention to be 
formulated as tablets, pills, capsules, liquids, gels, syrups, slurries, suspensions and 
the like, for oral ingestion by a patient to be treated. 

Agents intended to be administered intracellularly may be administered using 

15 techniques well known to those of ordinary skill in the art. For example, such agents 
may be encapsulated into liposomes, then administered as described above. 
Liposomes are spherical Upid bilayers with aqueous interiors. All molecules present 
in an aqueous solution at the time of liposome formation are incorporated into the 
aqueous interior. The liposomal contents are both protected fh)m the external 

20 microenvironment and, because liposomes fuse with cell membranes, are efficiently 
delivered into the cell cytoplasm. Additionally, due to their hydrophobicity, small 
organic molecules may be directly administered intracellularly. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 

25 achieve the intended purpose. Determination of the effective amounts is well within 
the capability of those skilled in the art, especially in light of the detailed disclosure 
provided herein. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically acceptable carriers comprising excipients and 
30 auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutically. The preparations formulated for oral 
administration may be in the form of tablets, dragees, capsules, or solutions. 
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including those formulated for delayed release or only to be released when the 
pharmaceutical reaches the small or large intestine. 

The pharaiaceutical compositions of the present invention may be 
manufactured in a manner that is itself known, e.g., by means of conventional 
5 mixing, dissolving, granulating, dragee-making, levitating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of the active compoimds in water-soluble form. Additionally, suspensions 
of the active compounds may be prepared as appropriate oily injection suspensions. 

10 Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 
synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Aqueous injection suspensions may contain substances which increase the viscosity 
of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
Optionally, the suspension may also contain suitable stabilizers or agents which 

1 5 increase the solubility of the compoimds to allow for the preparation of highly 
concentrated solutions. 

Pharmaceutical preparations for oral use can be obtained by combining the 
active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 

20 obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as 
sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such 
as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
caiboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 

25 disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, or alginic acid or a salt thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 

30 dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
DyestufTs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound doses. 
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Phannaceutical preparations which can be used orally include push-fit 
capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 
ingredients in admixture with filler such as lactose, binders such as starches, and/or 
5 lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, 
such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. 



I^ Examples 

As described in the Examples below, we have demonstrated that a single 
nucleotide polymorphism can cause dramatic structural variation in allelic forms of 
human mRNA. The structural differences occur both at and in regions flanking the 

1 5 SNP. The structural differences allow allele-specific cleavage of mRNA with 
oligonucleotides complementary to a site of maximum differential structural 
accessibility that is independent of single base discrimination at the polymorphic 
site. These findings are of particular interest in several respects. First, the results 
illustrate how SNPs can affect mRNA structure, and suggest mRNA-structure- 

20 dependent mechanisms by which SNPs can cause allele-specific biological 

consequences. Further elucidation of the contribution of common genetic variation 
to structural and functional diversity of mRNA will provide insight into fundamental 
mechanisms of human phenotypic variation, and facilitate studies of disease 
susceptibility and drug response (4-7). Second, the results described here caution 

25 researchers to consider the effect of genetic variation in their characterization of 
RNA as a therapeutic target. Lastly, our novel strategy of allelic structure-based 
mRNA targeting may be applied to therapeutic allele-specific RNA down- 
regulation, preferably in the 5*- and 3'-UTR regions, using other types of specific 
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RNA binders such as small molecule inhibitors (30-32) to allow enhanced cellular 
uptake and improved allele discrimination. 

Example 1 

5 In vitro Synthesis and Purification of mRNA. Human cDNAs for PGR 

synthesis of transcription templates were prepared from total RNA of Epstein-Barr 
virus (EB V) transformed lymphoblasts (Coriell Institute) or tumor cell line NCI- 
H292 (American Type Culture collection). The cDNAs were genotyped for SNPs in 
human AARS and RPA70 either by sequencing or by restriction fragment length 

10 polymorphism (RFLP) analysis. A second round cDNA sequencing was performed 
to confirm the nucleotide sequence of AARS and RPA70 cDNAs. Fragments of 
human mRNA alleles (1 13-1000 mers) varying in a single base were obtained by in 
vitro transcription from PGR derived DNA templates using Megashortscript or 
Megascript T7 kits (Ambion). Crude transcripts were either purified on an 8% 

15 denaturing polyacrylamide gel or on a G-50 sephadex spin column (Boehringer 

Mannheim), and the purified mRNA was subsequently dephosphorylated using calf 
intestinal alkaline phosphatase (Pharmacia) and 5 '-end-labeled with ^^P using [y- 
^^P]ATP (DuPont NEN) and T4 polynucleotide kinase (Pharmacia). 

20 Example 2 

Enzymatic Mapping of Allelic mRNA Structures. Partial digestion of 1 
pmol of 5'-^^P-labeled allelic mRNA by nuclease SI or RNase Tl (Pharmacia) was 
carried out at 37°C for 5 min in 10 ^iL buffer containing 10 mM Tris^HCl, pH 7.5, 
100 mM KCl, 5 mM MgCla, and 5 \ig phenol-extracted glycogen. Prior to the 
25 addition of S 1 or Tl , the RNA was renatured in the assaying buffer by heating for 
45 sec at 85**C, then cooling for 5 min at room temperature followed by 1 0 min at 
37°C. The digestion products were fractionated on either a 4, 6 or an 8% denaturing 
polyacrylamide gel, and the cleavage patterns visualized by autoradiography. RNase 
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Tl digestion at 55°C and alkaline hydrolysis at 85 °C were done to generate 
sequence ladders for cleavage site identification. RNase Tl digestion at 55°C was 
done for 1 min in 15 nL buffer containing 25 mM sodium acetate, pH 5.2, 25 mM 
KCl, and 3 jig yeast tRNA. Alkaline hydrolysis was done for 15 min in 15 jxL buffer 
5 containing 50 mM sodium bicarbonate, pH 9.2, 1 mM EDTA, and 1 0 fig phenol- 
extracted glycogen. 

Example 3 

E. coli RNase H Digestion Assay. Phosphorothioate 
1 0 oligodeoxyribonucleotides (PS-oligos) were employed in our studies because of 
their enhanced stability against nucleases (20). PS-oligos were chemically 
synthesized and desalted on C-18 cartridges by Synthetic Genetics. One pmol of 5'- 
^^P-labeled mRNA was renatured in 12 RNase H buffer (20 mM Tris^HCl, pH 
7.5, 10 mM KCl, 5 mM MgC^, 0. 1 mM dithiothreitol (DTT), 5% glycerol, and 10 
15 |ig phenol-extracted glycogen) by heating for 1 min at 85^C, then cooling for 5 min 
at room temperature followed by 10 min at 37°C. E. coli RNase H (0.6 units, 
Pharmacia) was then added to the renatured mRNA and incubated for 5 min at 3TC. 
PS-oligos (5 pmol in RNase H buffer) were subsequently added, and the reaction 
mixture (15 fiL total) incubated for 20 min at 37**C. The digestion products were 
20 fractionated on a 4% denaturing polyacrylamide gel, and the cleavage patterns 

visualized by autoradiography. RNA levels were quantified using a phosphoimager 
(Fujifilm Model FLA.2000). 

Example 4 

25 Oligonucieotide-directed Cleavage of Endogenous RPA70 mRNA in 

Human Cell Extracts. Human tumor cell lines HeLa and Calu-1 (American Type 
Culture collection) were genotyped for SNPs in RPA70 cDNA by RFLP analysis 
and by sequencing. The genotyping results showed that HeLa and Calu-1 express 



wo 00/52210 



43 



PCT/USOO/05271 



only the U- or the C-base at the 1 674U/C polymorphism, respectively. HeLa cells 
were cultured in monolayers in minimum essential medium (Gibco BRL), and Calu- 
1 cells were cultured in monolayers in McCoy's 5A medium (Sigma). Both media 
were supplemented with 10% heat inactivated fetal bovine serum (JRH 
5 Biosciences), 100 units/mL penicillin- 1 00 jig/mL streptomycin (Gibco BRL), and 2 
mM L-glutamine (Gibco BRL). Both cell lines were grown under 5%C02/95% air in 
a himiidified incubator at 37°C. For preparation of cell extracts, the cells were 
grown to 80-90% confluence, typsinized, and washed twice with the complete 
medium as specified above and twice with phosphate buffered saline (Gibco BRL). 

10 Approximately 5x10^ Cells were lysed in a 15 mL tissue grinder with a teflon pestle 
(Wheaton) in the presence of 1.5 mL lysis buffer (20 mM Tris^HCl, pH 7.5, 420 
mM NaCl, 2 mM MgCb, 0.2 mM EDTA, 1 mM DTT, 0.5 mM phenybnethyl 
sulfonylfluoride (PMSF), and 25% glycerol). The sodium salt concentration was 
chosen to facilitate the disruption of the nuclear membrane to release RNase H from 

15 nuclei (21). Up to 200 strokes were needed to completely lyse the cells, as checked 
using a hemacytometer. The cell lysate was centrifuged at 14,000 rpm for 10 min at 
4°C, and cell extracts were obtained by transferring supernatant to clean 
microcentrifuge tubes and freezing immediately at -SO'^C. Total protein 
concentration of cell extracts was determined using the Bradford assay (Bio-Rad). 

20 For oligodeoxynucleotide-directed cleavage of endogenous mRNA reactions, cell 
extracts were diluted to a total protein concentration of 3 mg/mL with the lysis 
buffer. After the addition of a 1 OX PS-oligo solution to the diluted cell extracts, the 
reaction mixture was incubated for 40 min at 37''C. The level of intact RPA70 
mRNA was determined by total RNA isolation and Northern blot hybridization 

25 using procedures described previously (22, 23), and followed by phosphoimager 
analysis. Northern probes specific for RPA70 mRNA were synthesized by random 
primed [a- PJdCTP-labeling of a cDNA probe corresponding to. nucleotides 1519- 
2080 of human RPA70 mRNA. To account for variability in the amoimt of total 
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RNA loaded into individual gel wells for Northern analysis, the level of 
glyceraldehyde 3 -phosphate dehydrogenase (GAPDH) mRNA was also measured by 
first stripping the RPA70 mRNA probe, and then reprobing the same Northern blot 
with a random primed [a-^^P]dCTP-labeled cDNA probe corresponding to a 600 
5 nucleotide sequence of human GAPDH (Stratagene). 

Example 5 

Characterization of SNP-containing mRNA Structures. Two SNPs in the 
coding regions of two hiunan mRNAs were analyzed in this study: a U/C transition 

10 at nucleotide 1013 of human alanyl tRNA synthetase (AARS, GenBank Accession # 
D32050)) and a U/C transition at nucleotide 1674 of human replication protein A, 70 
kDa subunit (RPA70, GenBank Accession # M63488)). To investigate effects of 
these SNPs on mRNA structure, we first synthesized both allelic SNP-containing 
fragments of the two mRNAs by in vitro transcription and then performed mRNA 

15 structural mapping using structure-specific enzymes RNase Tl and/or nuclease SI . 

Nuclease S 1 cleaves both DNA and RNA in single-stranded regions (24, 25). 
It is a useful structural probe for identifying large unpaired accessible regions (> 4 
nucleotides) in RNA. Fig. la shows nuclease SI mapping of a 113-nucleotide 
fragment of AARS mRNA (nucleotides 951-1063). Differences in SI cleavage 

20 patterns were observed between the U- and the C-alleles. Strong cleavages centered 
at nucleotide 999 of the U-allele revealed the presence of a single-stranded region 
approximately 14 nucleotides upstream of the SNP. The majority of the species of 
the C-allele lack such a single-stranded region, as shown by the absence of strong 
cleavage bands in the same region. Allelic differences in SI accessibility were also 

25 observed at several other sites including regions 980 and 1 032 (Fig. 1 a). At higher 
SI concentrations both alleles exhibited similar cleavage patterns at sites 980 and 
1032, suggesting that the stronger cleavage intensities at these sites in the C-allele 
may be due partly to the absence of a large unpaired region at position 999. 
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Structural mapping by RNase Tl (data not shown), which identifies unpaired 
guanosine residues (25), corroborated the finding of substantial allelic structural 
differences within ±14 nucleotides of the SNP. The enzymatic mapping experiments 
were also done on a 1000-nucleotide fragment (nucleotides 529-1528) of the AARS 
5 mRNA. Differential S 1 digestion patterns of the two alleles identical to those at sites 
980, 999 and 1032 in Fig. la were observed with the 1000-mer mRNA sequence 
(data not shown). 

Pronounced allelic structural differences caused by a single nucleotide 
variation in the mRNA sequence were also detected in a 544-nucleotide fragment of 
10 RPA70 mRNA (Fig. lb). The fragment, consisting of nucleotides 1378-1921, was 
centered about the 1674U/C polymorphism. Intense SI cleavages were detected in 
the U-allele around nucleotide 1656, indicating the presence of a single-stranded 
region approximately 18 nucleotides upstream of the SNP. In contrast, no cleavage 
above background intensity was detected at the same site of the C-allele, Differential 

15 SI cleavage between the two alleles was also observed at nucleotide 1664 where the 
U-allele was more weakly cleaved than the C-allele. These data thus defined a 
differential-structure motif between the two allelic forms of RPA70 mRNA, with a 
maximum allelic difference in SI accessibility in the region of nucleotide 1656. In 
addition, the structural mapping data on both the RPA70 and the AARS mRNAs 

20 revealed that the polymorphic sites were relatively inaccessible by single-strand- 
specific nuclease SI. Proposed secondary structiu-al models of the polymorphic 
regions of the pairs of mRNA alleles are shown in Fig. 2. 

Since the SNP-dependent allelic structural differences described above were 
identified using mRNA fragments in a purified system, it is critical to examine 

25 whether our. findings of allele-specific structural features exist in the context of full- 
length mRNA in a protein-rich environment mimicking the physiological conditions 
in cells. To address this issue, we used a structure-based mRNA targeting strategy to 
monitor allelic structural accessibility of RPA70 mRNA in two different assay 
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systems: a purified system containing niRNA fragments and a cell-mimicking 
system containing endogenous human RPA70 mRNA. We designed a series of 
phosphorothioate oligodeoxyribonucleotides (PS-oIigos) complementary to the 1656 
region of RPA70 mRNA that contains the maximuni allelic difference in structural 
5 accessibiUty as identified by SI mapping. The target site at the region of 1656 was 
chosen to span 20 or 15 nucleotides that do not include the 1674U/C polymorphism. 
Thus, the target site was identical in nucleotide sequence between the U- and C- . 
alleles, and was predicted based on S 1 mapping results to be accessible for PS-oligo 
binding in the U-allele, and inaccessible in the C-allele. Exogenous or endogenous 
10 RNase H, which cleaves RNA in RNA^DNA hybrids (26), was used to monitor the 
targeting efficiencies of PS-ohgos for the two alleles of RPA70 mRNA. 

Example 6 

Allele-specific Cleavage of RPA70 mRNA in a Purified System. We first 
15 used an in vitro assay with E. coli RNase H and the in vitro transcribed 544-mer 
RPA70 mRNA fragments to examine the efficiency and allele specificity of 
structure-based targeting as compared to directly targeting the 1674U/C 
polymorphism. Fig. 3a shows nucleotide sequences of the PS-oligos designed to 
target the RPA70 polymorphism (1, U-allele specific; 2, C-allele specific) or the site 
20 of maximum differential structural accessibility in the 1656 region (3-6). SNP- 

targeting oligos 1 and 2 were previously optimized for allele discrimination in a cell 
transfection assay (J. P. Basilion, unpublished results). Each of the SNP-targeting 
oligos is perfectly matched only with one of the two alleles; it is singly mismatched 
with the other allele. Single base mismatches formed between the SNP-targeting 
25 oligos and their respective non-targeting alleles contribute to the allele 

discrimination observed previously. Structure-targeting oligos 3 and 6 match 
perfectly with both alleles, and were designed to hybridize maximally with the 
single-stranded 1656 region of the U-allele, while avoiding the 1664 region which is 
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slightly accessible in both alleles (Fig. lb). Four and six additional nucleotides were 
added to the 5 '-end of oligo 3 to produce oligos 4 and 5, respectively, to facilitate 
folding into hairpin structures for enhanced allele discrimination (27). 

Results of oligo-directed, site-specific cleavage of the 544-mer U- and C- 
5 alleles by £. coli RNase H are shovm in Fig. 3b-c. As predicted, the U-allele that 
was more sensitive to SI cleavage than the C-allele in the 1656 region was over 10- 
fold more efficiently cleaved by E. coli RNase H in the presence of structure- 
targeting PS-ohgos 3-6. Also as expected, the cleavage efficiencies with PS-oligos 1 
and 2 that target the SNP at position 1674 were greater for their respective matched 

10 alleles than for the mismatched alleles. Moreover, the oligos targeting the 

polymorphic site were much less effective in inducing target cleavage than the 
oligos targeting the single-stranded-structure site of the U-allele: an approximately 
14- to 25-fold reduction in cleavage efficiency was observed. This is consistent with 
the polymorphic site being less accessible than the single-stranded 1656 region in 

1 5 the U-allele, as identified by S 1 mapping. 

Oligos 4 and 5 with predicted hairpin structures exhibited a 23% or 36% 
reduction, respectively, in the cleavage of both alleles as compared to their parent 
oligo 3. Therefore, no enhancement in allele discrimination was obtained with oligos 
4 and 5 under the experimental conditions. 

20 

Example 7 

AUele-specific Cleavage of EDdogenous RPA70 mRNA in Human Cell 
Extracts. We next performed oligo-directed cleavage of the full-length, endogenous 
RPA70 mRNA in whole cell extracts prepared firom two genotyped human cell 
25 lines, HeLa (1674U) and Calu-l (1674C). Cell lysis conditions were chosen to allow 
the disruption of both the plasma and the nuclear membranes in the absence of 
denaturants deleterious to the integrity of cellular proteins and nucleic acids. In order 
to evaluate variations between the two different cell types, we introduced a non- 
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allele-specific, 20-mer PS-oligo 7 to the assay. Oligo 7 is complementary to the 
2230-2249 sequence region of the 3 '-UTR of human RPA70 mRNA; it was 
previously shown to target the 1674U- and 1674C-alleles with equally high 
efficiency in a cell transfection assay (J. P. Basilion, unpubhshed results). 
5 Fig. 4 compares the extent of site-specific cleavage of the 1674U- and 

1674C-alleles of the endogenous human RPA70 mRNA in the presence of 100, 300 
and 600 nM of either U- or C-allele-specific oligos 1, 2, 3 and 6, or oligo 7 that 
targets both alleles. Both the full-length and one or two cleavage products were 
detected by Northem blot hybridization using [a-^^P]dCTP-labeled cDNA probes 

1 0 complementary to nucleotides 1519-2080 of RP A70 mRNA. Since no E. cqli RNase 
H was added to the reaction mixture, the site-specific cleavage of the mRNA is 
attributable to the presence of endogenous human RNase H in the cell extracts. 
Quantitative analysis of the extent of mRNA cleavage was done by integrating the 
fiiU-length mRNA signal instead of the cleavage products. The latter did not 

1 5 hybridize with probes to the same extent because of probe design and were also 
likely more susceptible to inhomogeneous degradation by RNases in the cell 
extracts. The variation in sample loading was accounted for by probing a non- 
targeted message, glyceraldehyde 3-phosphate dehydrogenase (GAPDH), in the 
same Northem blots. 

20 The two cell types responded similarly to the addition of control oligo 7 over 

a concentration range of 100 to 600 nM (Fig. 4). In contrast, oligos 1, 2, 3 and 6 
displayed various degrees of targeting efficiency in an allele-specific pattern similar 
qualitatively to the results obtained in the purified system. In the presence of 300 
nM of 20-mer 1 or 18-mer 2 targeting the 1674U/C polymorphic site, the majority of 

25 the matched allele remained uncleaved, and the matched allele was equally or 

slightly more cleaved than the singly mismatched allele (Fig. 4a). These results can 
be explained by the inaccessibility of the 1674U/C polymorphic site as identified by 
in vitro structural mapping, and by the predicted small difference in AG** of duplex 
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formation (ca. 1-2 kcal mol"') (28, 29) between the matched and singly mismatched 
heteroduplexes. On the other hand, in the presence of 300 nM 20-mer 3 that targets 
the single-stranded structural motif of the U-allele, only 15% of the U-allele 
remained uncleaved, while more than 90% of the non-target C-allele was intact (Fig. 
5 4b). Such high efliciency and specificity of U-allele targeting by 3 is consistent with 
an accessible single-stranded structure in the 1656 region of the U-allele and an 
inaccessible structure in the same region of the C-allele (see Figs, lb, 2b). Based on 
the data in Fig. 4, we conclude that under the experimental conditions, the allele 
discrimination obtained by targeting the different allelic structures of RPA70 mRNA 
1 0 with 20-mer 3 is two- to sixfold greater than that obtained by directly targeting the 
polymorphism with 20-mer 1 when the percentage of intact mRNA is evaluated. The 
1 5-mer oUgo 6 also showed better allele discrimination than the SNP-targeting 
oligos 1 and 2 at all three oligo concentrations, albeit less prominent than that of 20- 
mer 3. 

15 It is important to note that the minor allele discrimination observed with the 

SNP-targeting oligos stemmed fi-om the difference in AG"" of duplex formation 
between the matched and singly mismatched heteroduplexes, as well as the 
differential RNase H activities associated with the matched and singly mismatched 
duplexes. In contrast, complete complementarity of the structure-targeting oligos 

20 with their target site sequences is expected for both the target and non-target alleles, 
because these target sites do not contain the polymorphism. Thus, the greater allele 
discrimination by structure-targeting oligos reflects a significant difference in the 
target site accessibility between the two alleles. This accessibility difference could 
be caused by either different structural folds of the two alleles of endogenous 

25 RPA70 mRNA, or different contacts with cellular components, or both effects. The 
positive correlation we have observed between the results in the purified system and 
those in cell extracts strongly support the argument that the pronoimced allele 
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discrimination obtained with the structure-targeting oligos is caused mainly by 
structural differences between the two endogenous RPA70 mRNA alleles. 
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All patents and publications mentioned in the specification are indicative of 
the levels of skill of those skilled in the art to which the invention pertains. All 

1 0 references cited in this disclosure are incorporated by reference to the same extent as 
if each reference had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is 
well adapted to carry out the objects and obtain the ends and advantages mentioned, 
as well as those inherent therein. The methods, variances, and compositions 

15 described herein as presently representative of preferred embodiments are exemplary 
and are not intended as limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art, which are encompassed within 
the spirit of the invention, are defined by the scope of the claims. 

It will be readily apparent to one skilled in the art that varying substitutions 

20 and modifications may be made to the invention disclosed herein without departing 
firom the scope and spirit of the invention. For example, using other compounds, 
and/or methods of administration are all within die scope of the present invention. 
Thus, such additional embodiments are within the scope of the present invention and 
the following claims. 

25 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not 
specifically disclosed herein. Thus, for example, in each instance herein any of the 
terms "comprising", "consisting essentially of and "consisting of may be replaced 
with either of the other two terms. The terms and expressions which have been 

30 employed are used as terms of description and not of limitation, and there is no 

intention that in the use of such terms and expressions of excluding any equivalents 
of the features shown and described or portions thereof, but it is recognized that 
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various modifications are possible within the scope of the invention claimed. Thus, 
it should be understood that although the present invention has been specifically 
disclosed by preferred embodiments and optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the 
5 art, and that such modifications and variations are considered to be within the scope 
of this invention as defined by the appended claims. 

In addition, where features or aspects of the invention are described in terms 
of Markush groups or other grouping of alternatives, those skilled in the art will 
recognize that the invention is also thereby described in terms of any individual 
10 member or subgroup of members of the Markush group or other group. 

Thus, additional embodiments are within the scope of the invention and 
within the following claims. 
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CLAIMS 

What is claimed is: 

1 . A method for identifying a potential small molecule inhibitor which inhibits. 
5 an activity of a cellular RNA by inducing formation of a complex with a said RNA 

and a cellular protein, wherein said potential small molecule inhibitor comprises a 
first moiety and a second moiety and a linker joining said first moiety and said 
second moiety, said method comprising: 

a) identifying a said first moiety which binds to at least a portion of a 
10 selected cellular protein; and 

b) identifying a said second moiety which binds to at least a fi*agment of a 
target cellular RNA; 

wherein a small molecule consisting essentially of said first moiety and said 
second moiety and said linker connecting said first and second moieties is a said 
1 5 potential small molecule inhibitor. 

2. The method of claim 1, wherein said identifying a said first moiety 
comprises identifying a known small molecule ligand of said cellular protein. 

20 3. The method of claim 2, fiirther comprising 

providing a small molecule library containing compoxmds consisting 
essentially of said first moiety and said linker attached to variable second moeities; 
and 

screening said library to identify one or more molecules wherein said second 
moiety binds to said target RNA. 

4. The method of claim 1 , wherein said identifying a said first moiety 
comprises: 

providing a small molecule library of compounds consisting essentially of a 
variable first moiety, a linker, and a fixed second moiety; and 

screening compounds of said library to identify a compound which binds to 
at least a portion of said selected cellular protein. 
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5. The method of claim 4, wherein said identifying a said second moiety 
comprises 

providing a small molecule library consisting essentially of a first moiety 
which binds to said selected cellular protein, a linker, and a variable second moiety; 
5 and 

screening compounds of said library to identify a compound which binds to 
said target RNA. 

6. The method of claim 2, wherein said selected cellular protein and its ligand 
10 are selected from the group consisting of adenosine triphosphate analogs and cellular 

ATPases; flavin adenine dinucleotide or FAD analogs; nicotinamide or nicotinamide 
analogs; folates or folate analogs and dihydrofolate reductase, thymidylate synthase 
or other abundant proteins which require folate as a cofactor; cyclosporin, FK506, 
rapamycin, coumermycin or their synthetic analogs and FK506 Binding Protein. 

15 

7. The method of claim 1 wherein two or more iterations of at least one of said 
identifications are performed to improve the characteristics of the small molecule. 

8. The method of claim 1, wherein the protein constituent of said complex is a 
20 cellular protein that is present in the cellular compartment(s) in which the target 

RNA is present. 

9. The method of claim 1, wherein the protein constituent of said complex is a 
protein with known RNA binding activity. 

25 

10. The method of claim 1, wherein the protein constituent of said complex is an 
RNase. 

11. The method of claim 1, wherein the protein constituent of said complex is an 
30 RNase fi-om the class RNaseH type 1, RNaseH type 2, and RNaseL. 

12. The method of claim 1, wherein the protein constituent of said complex is an 
RNA helicase. 
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25 



30 



13. The method of claim 1 , wherein the protein constituent of said complex is an 
DNA binding protein. 



5 1 4. The method of claim 1 , wherein the protein constituent of said complex has a 
two or more basic residues within 10 Angstroms of the small molecule binding site 
thereby providing favorable charge interactions with said target RNA. 

15. The method of claim 1, wherein said small molecules do not exceed 4000 
10 Daltons. 



16. The method of claim 1, wherein said small molecules do not exceed 2000 
Daltons 



15 17. The method of claim 1 , wherein said small molecules do not exceed 1 000 
Daltons 



18. The method of claim 1 , wherein the targeted cellular RNA is a pre-mRNA. 

20 19. The method of claim 1, wherein the targeted cellular RNA is an mRNA. 

20. The method of claim 1 , wherein said targeted cellular RNA has a 
polymorphism that results in an ahered secondary structure and the altered 
secondary structure is the target of the RNA binding moiety of said small molecule. 



2 1 . The method of claim 1 , wherein said targeted cellular RNA is an RNA 
bearing a mutation that alters secondary structure of said RNA, or that exists on the 
same allele as another site of variation that alters the secondary structure of said 
RNA. 



22. The method of claim 1, further comprising 
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providing a plurality of small molecules comprising a plurality of different 
said linkers and a first moiety identified in a), a second moiety identified in b), or 
both; and 

screening said plurality of small molecules to identify a said potential small 
5 molecule inhibitor with advantageous binding or pharmacologic characteristics. 

23. The method of claim 1, wherein said identifying a said second moiety 
comprises 

providing a small molecule library containing compoimds consisting 
1 0 essentially of a fixed said first moiety and said linker attached to variable second 
moieties; and 

screening said library to identify one or more molecules wherein said second 
moiety binds to said target RNA. 

1 5 24. The method of claim 23, wherein said identifying a said first moiety 
comprises: 

providing a small molecule library of compounds consisting essentially of a 
variable first moiety, a linker, and a fixed second moiety identified in claim 23; and 
screening compounds of said library to identify a compound which binds to 
20 said selected cellular protein or portion thereof. 



25. A method for identifying an RNA target for an allele specific 
therapeutic small molecule comprising: 
25 a) identifying at least one sequence variance in said RNA target in a 

population of interest: 

b) determining whether the RNA secondary structure of any sequence 
variants identified differs between variant alleles; 

wherein a difference in said RNA secondary structure is indicative that said 
30 sequence variance is a potential target for a said allele specific therapeutic small 
molecule. 
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26. The method of claim 25, wherein the sequence variance to be targeted occurs 
in a frequency range between 0. 1 :0.9 and 0.5:0,5 in a population of interest. 

27. The method of claim 25, wherein the determination of said RNA secondary 

5 structure is performed using a nuclease selected from the group consisting of Tl, T2, 
SI, U2, CL3, VI, A, PhyM, N.c. nuclease or RNase, or by chemical probing with a 
chemical selected from the group conssting of dimethylsulfate, 
diethylpyrocarbonate, CMCT, kethoxal, bisulfite, ethylnitrosourea, MPE-Fe(II), and 
Fe(II)-EDTA. 

10 

28. The method of claim 25,wherein said target RNA comprises an allele that, if 
reduced in abundance, would at least ameliorate a disease state. 

29. The method of claim 25, wherein said target RNA comprises an allele 
1 5 associated with an autosomal dominant disease. 

30. The method of claim 29, wherein said autosomal dominant disease is 
selected from the group consisting of Huntington's disease, cervical vertebral fusion, 
keratitis-ichthyosis-deafhess syndrome, progressive external ophthahnoplegia, type 

20 3, progressive extemal ophthalmoplegia, type 2, spastic paraplegia 6, progressive 
external ophthalmoplegia, pituitary dwarfism due to isolated growth hormone 
deficiency, distal renal tubular acidosis, vitamin D-resistant rickets, 
scapuloilioperoneal atrophy with cardiopathy, spastic paraplegia 4, spastic 
paraplegia 3, type II osteopetrosis, lamellar ichthyosis, nonsyndromic sensorineural 

25 3 deafness, iridogoniodysgenesis, type 2, spinocerebellar ataxia 7, microcephaly, 
torsion dystonia 1 , hereditary multi-infarct type dementia, pseudoxanthoma 
elasticum, autosomal dominant Lewy body in Parkinson disease, autosomal 
dominant nonsyndromic sensorineural 2 deafiiess, hypertelorism with esophageal 
abnormality and hypospadias, microcephaly with chorioretinopathy, diamond- 

30 blackfan anemia, hyperinsulinism, ectodermal dysplasia 3, nonsyndromic 
sensorineural 8 deafness, Larsen syndrome, hypoplastic local amelogenesis 
imperfecta 2, polycystic kidney disease 3, congenital nystagmus 2, Ehlers-Danlos 
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syndrome, type IV, mitochondrial DNA breakage syndrome secondary to nuclear 
mutation, and retinal cone dystrophy 2. 

31. The method of claim 25, wherein said target RNA comprises an allele of an 
5 essential gene which undergoes loss of heterozyosity in a proliferative disorder. 

32. The method of claim 25. wherein said allele specific small molecule 
comprises a small molecule inhibitor comprising a first moiety which binds to a 
selected cellular protein or a portion thereof; a second moiety which binds to a target 

1 0 cellular RNA or a fragment thereof, and a linker connecting said first moiety and 
said second moiety, wherein said small molecule inhibitor can bind to said selected 
cellular protein and to said target cellular RNA to form a protein-small molecule- 
RNA complex. 

15 

33. A method for identifying an allele specific RNA binding small molecule for 
a target RNA, comprising: 

a) identifying at least one sequence variance in said RNA target in a 
population of interest having secondary structure differences between variant alleles; 
20 b) screening the two RNA structures for a said sequence variance or 

variances for binding to or inhibition by a plurality of small molecules; and 

c) identifying a compoimd that preferentially binds to or inhibits one of said 
RNA structures, wherein said binding or inhibition is indicative that said compound 
is a said allele specific RNA binding small molecule. 

.34. The method of claim 33, wherein said plurahty of small molecules are 
elements of a small molecule combinatorial library. 

35. The method of claim 33, further comprising determining whether said allele 
specific RNA binding small molecule provides a cellular activity, comprising 

contacting cells producing said target RNA with said allele specific RNA 
binding small molecule; and 
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determining whether said small molecule produces a change in a cellular 
function. 

36. The method of claim 35, wherein said cellular function is a decrease in 
5 expression of said target RNA. 

37. The method of claim 33, wherein said target RNA is a pre-mRNA or an 
mRNA. 



10 38, The method of claim 33, wherem said plurality of small molecules comprises 
a plurality of potential small molecule inhibitors each comprising a first moiety 
which may be the same or different and a second moiety which differs between said 
plurality of small molecules and a linker joining said first moiety and said second 
moiety, wherein said first moiety is selected to bind to a selected cellular protein; 

1 5 and said second moiety is selected to bind to a target cellular RNA, thereby fonning 
a trimolecular complex. 

39. The method of claim 38, wherein said first moiety is a known small molecule 
hgand of said cellular protein or an analog or derivative thereof 



40. The method of claim 39, wherein said selected cellular protein and its ligand 
are selected from the group consisting of adenosine triphosphate analogs and cellular 
ATPases; flavin adenine dinucleotide or FAD analogs; nicotinamide or nicotinamide 
analogs; folates or folate analogs and dihydrofolate reductase, thymidylate synthase 
or other abundant proteins which require folate as a cofactor; cyclosporin, FK506, 
rapamycin, coumermycin or their synthetic analogs and FK506 Binding Protein. 

41 . The method of claim 38, wherein the protein constituent of the trimolecular 
complex is a cellular protein that is present in the cellular compartment(s) in which 
the target RNA is present. 

42. The method of claim 38, wherein the protein constituent of the trimolecular 
complex is a protein with known RNA binding activity. 
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20 



43. The method of claim 38, wherein the protein constituent of the trimolecular 
complex is an RNase. 



5 44. The method of claim 43, wherein the protein constituent of the trimolecular 
complex is an RNase from the class RNaseH type 1 , RNaseH type 2, and RNaseL. 

45. The method of claim 38, wherein the protein constituent of the trimolecular 
complex is an RNA helicase. 

10 

46. The method of claim 38, wherein the protein constituent of the trimolecular 
complex is an DNA binding protein. 

47. The method of claim 38, wherein the protein constituent of the trimolecular 
15 complex has a two or more basic residues within 10 Angstroms of the small 

molecule binding site thereby providing favorable charge interactions with said 
target RNA. 



48. The method of claim 38, wherein the targeted cellular RNA is a pre-mRNA. 



49. The method of claim 38, wherein the targeted cellular RNA is an mRNA. 



50. A pharmaceutical composition comprising 

25 a synthetic small molecule inhibitor comprising a first moiety which binds to 

a selected cellular protein; a second moiety which binds to a target cellular RNA, 
and a linker connecting said first moiety and said second moiety, wherein said small 
molecule inhibitor can bind to said selected cellular protein and to said target RNA 
to form a complex thereby inhibiting an activity of said target RNA; and 

30 a pharmaceutically acceptable carrier, excipient, or diluent. 

5 1 . The composition of claim 50, wherein said selected protein is a human 
protein. 
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52. The composition of claim 50, wherein said target RNA is a human RNA, 

53. The composition of claim 50, wherein said small molecule inhibitor inhibits 
5 expression of said target RNA. 

54. The composition of claim SO, wherein said first moiety is a natural ligand of 
said selected protein or an analog thereof. 

10 55. The composition of claim 50, wherein said small molecule inhibitor has 
differential activity on variant forms of said RNA differing in secondary structure, 
wherein said difference in secondary structure is related to at least one sequence 
variance. 



15 56. The composition of claim 50, wherein the protein constituent of the complex 
is a cellular protein that is present in the cellular compartment(s) in which the target 
RNA is present. 

57. The composition of claim 50, wherein the protein constituent of the complex 
20 is a protein with known RNA binding activity. 

58. The composition of claim 50, wherein the protein constituent of the complex 
is an RNase. 



59. The composition of claim 50, wherein the protein constituent of the complex 
is an RNase from the class RNaseH type 1, RNaseH type 2, and RNaseL. 

60. The composition of claim 50, wherein the protein constituent of the complex 
is an RNA helicase. 

61. The composition of claim 5 0, wherein the protein constituent of the complex 
is an DNA binditig protein. 
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62. The composition of claim 50, wherein the protein constituent of the complex 
has a two or more basic residues within 10 Angstroms of the small molecule binding 
site thereby providing favorable charge interactions with said target RNA. 

5 63. The composition of claim 50, wherein said small molecules do not exceed 
4000 Daltons. 

64. The composition of claim 50, wherein the targeted cellular RNA is a pre- 
mRNA. 

10 

65. The composition of claim 50, wherein the targeted cellular RNA is an 
mRNA. 

66. The composition of claim 50, wherein said targeted cellular RNA has a 
15 polymorphism that results in an altered secondary structure and the altered 

secondary structure is the target of the RNA binding moiety of said small molecule. 

67. The composition of claim 50, wherein said targeted cellular RNA is an RNA 
bearing a mutation that alters secondary structure of said RNA, or that exists on the 

20 same allele as another site of variation that alters the secondary structure of said 
RNA. 

68. A method for inhibiting expression of an mRNA, comprising 
a) contacting cells normally expressing said mRNA with a small molecule 

inhibitor, said inhibitor comprising 

a first moiety which binds to a selected cellular protein or a portion thereof; a 
second moiety which binds to a target cellular mRNA, and a linker connecting said 
first moiety and said second moiety, wherein said small molecule inhibitor can bind 
to said selected cellular protein and to said target cellular mRNA to form a complex 
thereby inhibiting an activity of said target RNA. 



69. The method of claim 68, wherein said selected protein is a human protein. 
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70. The method of claim 68, wherein said target RNA is a human RNA. 

71 . The method of claim 68, wherein said small molecule inhibitor inhibits 
5 expression of said target RNA. 

72. The method of claim 68, wherein said first moiety is a natural ligand of said 
selected protein or an analog thereof. 

10 73. The method of claim 68, wherein said small molecule inhibitor has 

differential activity on variant forms of said RNA differing in secondary structure, 

• * 

wherein said difference in secondary structure is related to at least one sequence 
variance. 

15 74. The method of claim 68, wherein the protein constituent of the complex is a 
cellular protein that is present in the cellular compartment(s) in which the target 
RNA is present. 

75. The method of claim 68, wherein the protein constituent of the complex is a 
20 protein with known RNA binding activity. 

76. The method of claim 68, wherein the protein constituent of the complex is an 
RNase. 

25 77. The method of claim 68, wherein the protein constituent of the complex is an 
RNase from the class RNaseH type 1 , RNaseH type 2, and RnaseL. 

78. The method of claim 68, wherein the protein constituent of the complex is an 
RNA helicase. 

30 



79. The method of claim 68, wherein the protein constituent of the complex is an 
DNA binding protein. 
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80. The method of claim 68, wherein the protein constituent of the complex has 
a two or more basic residues within 10 Angstroms of the small molecule binding site 
thereby providing favorable charge interactions with said target RNA. 

5 81 . The method of claim 68, wherein said small molecules do not exceed 4000 
Daltons. 

82. The method of claim 68, wherein the targeted cellular RNA is a pre-mRNA. 
10 83 . The method of claim 68, wherein the targeted cellular RNA is an mRNA. 

84. The method of claim 68, wherein said targeted cellular RNA has a 
polymorphism that results in an altered secondary structure and the altered 
secondary structure is the target of the RNA binding moiety of said small molecule. 

15 

85. The method of claim 68, wherein said targeted cellular RNA is an RNA 
bearing a mutation that alters secondary structure of said RNA, or that exists on the 
same allele as another site of variation that alters the secondary structure of said 
RNA. 

20 

86. A method for treating an autosomal dominant or gene dosage disease, 
comprising 

administering to a patient suffering from a said disease a therapeutically 
effective amount of an allele specific RNA inhibitor preferentially active on an 
allelic form of a target RNA in which at least one sequence variance provides a 
difference in secondary structure and wherein the allele specificity involves said 
difference in secondary structure. 

87. The method of claim 86, wherein said disease is an autosomal dominant 
disease, and said patient is heterozygous for the gene corresponding to said target 
RNA. 
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88. ' The method of claim 86, wherein said disease is a gene dosage disease, and 
said administering is adjusted to reduce but not eliminate expression of said gene. 

89. The method of claim 88, wherein said patient is heterozygous for said gene, 
5 and said inhibitor preferentially reduces or eliminates expression of only one allele. 

90. The method of claim 86, wherein said inhibitor is a small molecule 
comprising 

a first moiety which binds to a selected cellular protein or a portion thereof; a 
10 second moiety which binds to a target cellular RNA or a firagment thereof, and a 
linker connecting said first moiety and said second moiety, wherein said small 
molecule inhibitor can bind to a selected cellular protein and to said target RNA to 
form a complex, thereby inhibiting an activity of said target RNA. 

15 

91. A method of treating a patient suffering fcom a disease or condition, 
comprising 

administering to said patient an allele specific inhibitor of a target RNA 
wherein the allele specificity involves a difference in secondary structure related to 
20 at least one sequence variance. 

92. A method for producing a pharmaceutical agent, comprising 
identifying a small molecule inhibitor of a cellular RNA, wherein said small 

25 molecule inhibitor comprises a first moiety and a second moiety and a linker joining 
said first moiety and said second moiety and said inhibitor will form a complex with 
a selected cellular protein which binds with said first moiety and said cellular RNA, 
said method comprising: 

a) identifying a said first moiety which binds to a selected cellular protein or 
30 a portion thereof; and 

b) identifying a said second moiety which binds to a target cellular RNA or 
a fragment thereof; 
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c) synthesizing said small molecule inhibitor- in an amount sufficient to 
provide a therapeutic response in a patient. 

93. The method of claim 92, further comprising selecting a hnker which allows 
5 advantageous complex formation. 

93. The method of claim 92, wherein said inhibitor is preferentially active on an 
allehc form of an mRNA in which at least one sequence variance provides a 
difference in RNA secondary structure. 

10 

94. A synthetic complex-forming molecule comprising 

a first moiety which binds to a selected cellular protein; a second moiety 
which binds to a target cellular RNA, and a linker connecting said first moiety and 
1 5 said second moiety, wherein said small molecule inhibitor can bind to said selected 
cellular protein and to said target cellular RNA to form a complex. 

95. The molecule of claim 94, wherein said molecule will inhibit expression of a 
target mRNA under translation conditions. 

20 

96. The molecule of claim 94, wherein said selected cellular protein is an RNA 
binding protein or a derivative thereof. 
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(1) Hydrocarobon chains of various lengths and chemical composition. For example: 





«U„^*H hydrocarbon ring systems. Points of attachment 

(lUustrated by •) may be at any two different carbon atoms. For example: 



1 ^NH 







3 
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(3) Linkers built from carbohydrates (saccharides). Points of attachment (iUustrated 
by •) may be anywhere on each sugar ring. For example: 
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